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Description 

FIELD OF THE INVENTION 

5 [0001] The present Invention relates to polynucleotides encoding novel polypeptides, polypeptides encoded by the 
polynucleotides, and new uses of these. 

BACKGROUND OF THE INVENTION 

10 [0002] Currently, the sequencing projects, the detemilnatlon and analysis of the genomic DNA of various living or- 
ganisms have been in progress all over the world. The whole genomic sequences of more than 40 species of prokary- 
otes, a lower eui<aryote, yeast, a multicellular eukaryote, C. elegans, and a higher plants, arabidopsis, etc. are already 
determined. For human genome, presumably having 3 billion base pairs, the analysis was advanced under global 
cooperative organization, and a draft sequence was disclosed In 2001. Moreover, all the structures are to be clear and 

15 to be disclosed in 2002 - 2003. The aim of the determination of genomic sequence is to reveal the functions of all genes 
and their regulation and to understand living organisms as a network of Interactions between genes, proteins, cells or 
individuals through deducing the information In a genome, which Is a blueprint of the highly complicated living organ- 
isms. To understand living organisms by utilizing the genomic Information from various species is not only important 
as an academic subject, but also socially significant from the viewpoint of industrial application. 

20 [0003] However, determination of genomic sequences Itself cannot Identify the functions of ail genes. For example, 
as for yeast, only the function of approximately half of the 6000 genes, which Is predicted based on the genomic 
sequence, was able to be deduced. On the other hand, the human genome has been estimated to contain about 
30,000-40,000 genes. Further, 1 00,000 or more types of mRNAs are said to exist when variants produced by alternative 
splicing are taken Into consideration. Therefore, it Is desirable to establish "a high throughput analysis system of the 

25 gene functions" which allows us to identify rapidly and efficiently the functions of vast amounts of the genes obtained 
by the genomic sequencing. 

[0004] Many genes in the eukaryotic genome are split by introns Into multiple exons. Thus, It is difficult to predict 
correctly the structure of encoded protein solely based on genomic information, in contrast, cDNA, which is produced 
from mRNAthat lacks introns, encodes a protein as a single continuous amino acid sequence and allows us to Identify 

30 the primary structure of the protein easily In human cDNA research, to date, more than three million ESTs (Expression 
Sequence Tags) are publicly available, and the ESTs presumably cover not less than 80% of all human genes. 
[0005] The information of ESTs is utilized for analyzing the structure of human genome, or for predicting the exon- 
regions of genomic sequences or their expression profile. However, many human ESTs have been derived from prox- 
imal regions to the 3'-end of cDNA, and Information around the 5'-end of mRNA Is extremely little. Among human 

35 cDNAs, the number of the con'esponding mRNAs whose encoding full-length protein sequences are deduced is ap- 
proximately 13,000. 

[0006] It is possible to Identify the transcription start site of mRNA on the genomic sequence based on the 5'-end 
sequence of a full-length cDNA, and to analyze factors Involved in the stability of mRNA that is contained in the cDNA, 
or in its regulation of expression at the translation stage. Also, since a full-length cDNA contains atg codon, the trans- 

40 iation start site, in the5'-region, it can be translated into a protein in a correct frame. Therefore, It Is possible to produce 
a large amount of the protein encoded by the cDNAorto analyze biological activity of the expressed protein by utilizing 
an appropriate expression system. Thus, analysis of a full-length cDNA provides valuable information which comple- 
ments the Information from genome sequencing. Also, full-length cDNA clones that can be expressed are extremely 
valuable in empirical analysis of gene function and in industrial application. 

45 [0007] Therefore, if a novel human full-length cDNA is isolated, it can be used for developing medicines for diseases 
in which the gene is involved. The protein encoded by the gene can be used as a drug by itself. Thus, It has great 
significance to obtain a full-length cDNA encoding a novel human protein. 

[0008] In particular, human secretory proteins or membrane proteins would be useful by itself as a medicine like 
tissue plasminogen activator (TPA), or as a target of medicines like membrane receptors, in addition, genes for signal 

50 transduction-reiated proteins (protein kinases, etc.), glycoprotein-related proteins, transcription-related proteins, etc. 
are genes whose relationships to human diseases have been elucidated. Moreover, genes for disease-related proteins 
form a gene group rich in genes whose relationships to human diseases have been elucidated. 
[0009] Therefore, it has great significance to Isolate novel full-length cDNA clones of human, only few of which has 
been isolated. Especially, isolation of a novel cDNA clone encoding a secretory protein or membrane protein is desired 

55 since the protein Itself would be useful as a medicine, and also the clones potentially Include a gene involved in diseases, 
in addition, genes encoding proteins that are involved in signal transduction, glycoprotein, transcription, or diseases 
are expected to be useful as target molecules for therapy, or as medicines themselves. These genes forni a gene 
group predicted to be strongly involved in diseases. Thus, identification of the full-length cDNA clones encoding those 
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proteins has great significance. 

SUMMARY OF THE INVENTION 

5 [0010] An objective of the present invention is to provide polynucleotides encoding novel poiypeptides, polypeptides 
encoded by the polynucleotides, and novel usages of these. 

[0011] The inventors have developed a method for efficiently cloning, from a cDNA library having very high fullness- 
ratio, a human full-length cDNAthat is predicted to be a full-length cDNA clone, where the cDNA library is synthesized 
by an improved method (WO 01/04286) of the oligo-capping method (K. MaruyamaandS. Sugano, Gene, 138: 171-174 

10 (1994); Y, Suzuki et al., Gene, 200; 149-156 (1997)). Then, the nucleotide sequences of cDNAciones whose fullness 
ratio is high, obtained by this method, were determined mainly from their 5'-ends, and, if required, from 3'-ends. 
[0012] Further, representative clones, which were estimated to be novel and full-length, among the clones obtained, 
were analyzed for their full-length nucleotide sequences. The determined full-length nucleotide sequences were ana- 
lyzed by BLAST homology search of the databases shown below. Because the homology search of the present inven- 

15 tion is carried out based on the infonnation of full-length cDNAs including the entire coding regions, homology to every 
part of a polypeptide can be analyzed. Thus, in the present invention, the reliability of homology search has been 
greatly improved. 

[1] SwissProt 

20 (http;//www.ebi. ac.uk/ebi_docsSwissProt_db/swisshome.html), 

[2] GenBank (http://www.ncbi.nlm.nih.gov/web/GenBank), 
[3] UnlGene (Human) (http;//www.ncbi. nlm.nih.gov/UniGene), and 

[4] nr (a protein database, which has been constructed by combining data of coding sequences (CDS) in nucleotide 
sequences deposited in GenBank, and data of SwissProt, PDB (http://www.rcsb.org/pdb/index.html), PIR (http:// 
25 pir.georgetown.edu/pirwww/pirhome.shtml), and PRE (http://www.prf.or.jp/en/); overlapping sequences have 

been-removed.) 

[0013] Further, the gene expression profiles of cDNAciones whose full-length nucieotide sequence had been deter- 
mined were studied by analyzing the large-scale cDNA database constructed based on the 5'-end nucieotide sequenc- 
30 es of cDNAs obtained. In addition to the analysis for the expression profile by computer, the profiles of gene expression 
in iiving cells were also determined by PGR. The present inventors revealed the usefulness of the genes of the present 
invention based on these analysis results. 

[0014] In the present invention, gene functions were revealed by the analysis of expression profiles In silico based 
on the Information of full-length nucleotide sequences. The expression profiles used In the expression frequency anal- 

35 ysis were studied based on the database containing sufficient amount of fragment sequence data. The expression 
frequency analysis was carried out by referring, for these expression profiles, to the full-length nucleotide sequences 
of many cDNA clones obtained in the present invention. Thus, a highly reliable analysis can be achieved by referring 
to the fuii-length nucleotide sequences of a wide variety of genes for the sufficientiy large population for analysis (ex- 
pression profiles). Namely the results of expression frequency analysis using the full-length sequences of the present 

40 invention more precisely reflect the gene expression frequency in tissues and cells from which a certain cDNA library 
was derived. In other words, the information of full-length cDNA nucleotide sequence of the present invention made it 
possible to achieve the highly reliable expression frequency analysis. 

[0015] The full-length cDNAciones of this invention were obtained by the method comprising the steps of [1] preparing 
libraries containing cDNAs with the high fullness ratio by oligo-capping, and [2] assembling 5'-end sequences and 

45 selecting one with the highest probability of completeness in length in the cluster formed (there are many clones longer 
in the 5'-end direction). However, the uses of primers designed based on the 5'- and 3'-end sequences of polynucle- 
otides provided by the present invention enable readily obtaining full-length cDNAs without such a special technique. 
The primer, which is designed to be used for obtaining cDNAs capable of being expressed, is not limited to the 5'- end 
3'-end sequences of polynucleotide. 

50 [0016] Specifically, the present invention relates to a polynucleotide selected from the group consisting , of the fol- 
lowing (a) to (g): 

(a) a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID NOs 

shown in Table 1; 

55 (b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs shown 

in Table 1; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid se- 
quence of any one of SEQ ID NOs shown in Table 1 , wherein, in said amino acid sequence, one or more amino 
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acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes a 
polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing under stringent conditions to a polynucleotide comprising the nucleotide sequence 
of any one of SEQ ID NOs shown in Table 1 , wherein said nucleotide sequence encodes a polypeptide functionally 

5 equivalent to a polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypeptide 
encoded by the polynucleotide according to any one of (a) to (d) ; 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence of 
(a); and 

10 (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence of 

(a). 

[0017] The present invention also relates to a polypeptide encoded by the above-mentioned polynucleotide or a 
partial peptide thereof, an antibody binding to the polypeptide orthe peptide, and a method for immunologically assaying 
15 the polypeptide orthe peptide, which comprises the steps of contacting the polypeptide orthe peptide with the antibody, 
and observing the binding between the two. 

[0018] Furthermore, the present invention features a vectorcomprisingthe above-mentioned polynucleotide, atrans- 
formant carrying the polynucleotide or the vector, a transformant carrying the polynucleotide or the vector in an ex- 
pressible manner, and a method for producing the polypeptide or the peptide, which comprises the steps of culturing 

20 the transformant and recovering an expression product. 

[0019] Another feature of the present invention is an oligonucleotide comprising at least 1 5 nucleotides, said oligo- 
nucleotide comprising a nucleotide sequence complementary to the nucleotide sequence of any one of SEQ ID NOs: 
1 to 1639 or to a complementary strand thereof. This oligonucleotide can be used as a primer for synthesizing the 
above-mentioned polynucleotide or used as a probe for detecting the polynucleotide. The present invention includes 

25 an antisense polynucleotide against the polynucleotide or a part thereof, and a method for detecting the polynucleotide, 
which comprises the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide. 

30 

[0020] Still another feature of the present invention is a database of polynucleotides and/or polypeptides, said da- 
tabase comprising information on at least one of the nucleotide sequences of SEQ ID NOs: 1 to 1639 and/or on at 
least one of the amino acid sequences of SEQ ID NOs: 1640 to 3278. 

[0021] Herein, "polynucleotide" is defined as a molecule, such as DNA and RNA, in which multiple nucleotides are 

35 polymerized. There are no limitations on the number of the polymerized nucleotides. In case that the polymer contains 
relatively low number of nucleotides, it is also described as an "oligonucleotide", which is included in the "polynucleotide" 
of the present invention. The polynucleotide or the oligonucleotide of the present invention can bea natural or chemically 
synthesized product. Alternatively, it can be synthesized using a template polynucleotide by an enzymatic reaction 
such as PGR. Furthermore, the polynucleotide of the present invention may be modified chemically. Moreover, not only 

40 a single-strand polynucleotide but also a double-strand polynucleotide is included in the present invention. In this 
specification, especially In claims, when the polynucleotide is described merely as "polynucleotide", it means not only 
a single-strand polynucleotide but also a double-strand polynucleotide. When it means double-strand polynucleotide, 
the nucleotide sequence of only one chain is indicated. However, based on the nucleotide sequence of a sense chain, 
the nucleotide sequence of the complementary strand thereof is essentially determined. 

45 [0022] As used herein, an "isolated polynucleotide" is a polynucleotide the structure of which is not identical to that 
of any naturally occurring polynucleotide or to that of any fragment of a naturally occurring genomic polynucleotide 
spanning more than three separate genes. The term therefore includes, for example, (a) a DNA which has the sequence 
of part of a naturally occurring genomic DNA molecule in the genome of the organism in which it naturally occurs; (b) 
a polynucleotide incorporated into a vector or into the genomic DNA of a prokaryote or eukaryote in a manner such 

50 that the resulting molecule is not identical to any naturally occurring vector or genomic DNA; (c) a separate molecule 
such as a cDNA, a genomic fragment, a fragment produced by polymerase chain reaction (PGR), or a restriction 
fragment; and (d) a recombinant nucleotide sequence that is part of a hybrid gene, i.e., a gene encoding a fusion 
polypeptide. Specifically excluded from this definition are polynucleotides of DNA molecules present in mixtures of 
different (i) DNA molecules, (il) transfected cells, or (Hi) cell clones; e.g., as these occur in a DNA library such as a 

55 cDNA or genomic DNA library. 

[0023] The term "substantially pure" as used'herein in reference to a given protein or polypeptide means that the 
protein or polypeptide is substantially free from other biological macromolecules. For example, the substantially pure 
protein or polypeptide is at least 75%, 80%, 85%, 95%, or 99% pure by dry weight. Purity can be measured by any 
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appropriate standard method known in the art, for example, by column chromatography, polyacrylamlde gel electro- 
phoresis, or HPLC analysis. 

[0024] All the cDNAs provided by the present Invention are full-length cDNAs. The "full-length cDNA" herein means 
that the cDNA contains the ATGcodon, which is the start point of translation therein. The untranslated regions upstream 
and downstream of the protein-coding region, both of which are naturally contained in natural mRNAs, are not indis- 
pensable. It is preferable that the full-length cDNAs of the present invention contain the stop codon. 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0025] Figure 1 shows the restriction map of the vector pME1 8SFL3. 
DETAILED DESCRIPTION OF THE INVENTION 



[0026] All the clones (1 639 clones) of the present invention are novel and encodethe full-length polypeptides. Further, 
all the clones are cDNAs with the high fullness ratio, which were obtained by oligo-capping method, and also clones 
which are not identical to any of l<nown human mRNAs (namely, novel clones) selected by searching, for the 5'-end 
sequences, mRNA sequences with the annotation of "complete cds" in the GenBani< and UnlGene databases by using 
the BLAST homology search [S. F Altschui, W. GIsh, W. Miller, E. W. IVIyers & D. J. Lipman, J. iVIol. Bloi., 21 5: 403-41 0 
(1990); W. Gish & D. J. States, Nature Genet., 3: 266-272 (1993)]; they are also clones that were assumed to have 
higher fullness ratio among the members in the cluster formed by assembling. IVIost of the clones assessed to have 
high fullness ratio in the cluster had the nucleotide sequences longer in the 5'-end direction. 
[0027] All the full-length cDNAs of the present invention can be synthesized by a method such as PGR (Current 
protocols in iVIolecular Biology edit. Ausubel et al. (1987) Publish. John Wiley & Sons Section 6.1-6.4) using primer 
sets designed based on the 5'-end and 3'-end sequences or using primer sets of primers designed based on the 5'- 
end sequences and a primer of oligo dT sequence corresponding to poly A sequence. Table 1 contains the clone names 
of full-length cDNA of 1 639 clones of the present invention, SEQ ID NOs of the full-length nucleotide sequences, CDS 
portions deduced from the full-length nucleotide sequences, and SEQ ID NOs of the translated amino acids. The 
positions of CDS are shown according to the rule of "DDBJ/EMBL/GenBank Feature Table Definition" (http://www.ncbi, 
nim.nih gov/collab/FT/index.htmi). The start position number corresponds to the first letter of "ATG" that Is the nucle- 
otide triplet encoding methionine; the termination position number corresponds to the third letter of the stop codon, 
These are indicated being flanked with the mark However, with respect to the clones having no stop codon, the 
termination position Is indicated by the mark ">" according to the above rule. 



Table 1 



SEQ ID NO, of nucleotide 



3NB691 00001 80 
3NB691 0000850 
3NB6910001160 
3NB691 0001 290 
3NB691 0001 730 
3NB6920000290 
3NB6920002810 
3NB6920003300 
3NB6920005450 
3NB6920009120 
3NB6920010020 
3NB6920010220 
3NB6920013490 
3NB6920014330 
3NB6920014710 
3NB6920015110 
3NB6920015280 
3NB6920015570 
3NB6920016370 



Position of CDS 

505.. 1434 
132..836 
85..702 
194.. 706 
166..>1755 
991 ..1335 
8. 1375 
128..901 
21. .809 
247.. 1983 
59. .913 
108. .1289 
402.. 1214 
674.. 151 6 
369.. >1 990 
306.. 11 63 
162.. 1847 
407.. 1903 
25.. 348 



SEQ ID NO. Of a 
sequence 
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Table 1 (continued) 



"3"NB69200f7T90"" 
ADRGL1 0000020 
ADRGL1 00001 80 
ADRGL1 0000650 
ADRGL1 0001 600 
ADRGL1 0001 650 
ADRGL1 0001 820 
ADRGL20000740 
ADRGL20003230 
ADRGL20004280 
ASTR01 00001 80 
ASTR020000950 
ASTR0200041 70 
ASTR020004800 
BGG1 11 0002850 
BGGI120001610 
BGG1 120005330 
BGG1 120005440 
BGG1 120006840 
BGG1 120006930 
BGGI120010970 
BGGI120017140 
BNGH41 0000030 
BNGH410000130 
BNGH41 00001 70 
BNGH41 0000290 
BNGH41 0000330 
BNGH41 0000340 
BNGH41 0000390 
BNGH41 0000800 
BNGH41 0001 040 
BNGH410001180 
BNGH410001370 
BNGH410001530 
BNGH41 0001 770 
BNGH41 0001 900 
BNGH41 0001 980 
BNGH420004740 
BNGH420005320 
BRACE1 0000200 
BRACE1 0000420 
BRACE1 0000700 
BRACE1 0000730 
BRACE1 0000930 
BRACE10001150 
BRACE1 0001590 
BRACE1 0001 660 
BRACE1 0001 690 
BRACE1 0001 870 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



T5"rT06"8""" 
7.. 1773 
1742. .21 10 
425.. 1072 
10.. 1407 
107.. 1387 
2165 .2467 
114.. 1349 
118..516 
169..477 
158..2716 
110.. 1315 
1121. .1732 
1177..1539 
1119.. 1460 
454.. 11 85 
79. .1548 
453.. 1661 
140.. 1303 
38. 1627 
264.2078 
140. .1009 
780.. 1967 
1087, 1539 
1466, .1801 
51. ,1661 
2453 ,2782 
539 ,1477 
134„1135 
232 ,,549 
146„1651 
166„>2624 
61. .1740 
1368 .1760 
82. 1773 
113..>2458 
97.. 1476 
1171. .1473 
354.. 1937 
959. 1267 
214.1332 
181 ,,900 
168„698 
45, .1388 
76. .624 
225..767 
777.. 1325 
557..889 
85.. 1704 



SEQ ID NO, of amino acid 

sequence 
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Table 1 (continued) 



"BRACE2d000770 " 
BRACE20001000 
BRACE20001410 
BRACE20002800 
BRACE20003320 
BRACE20004210 
BRACE20005050 
BRACE20005250 
BRACE20005450 
BRACE20005650 
BRACE20005770 
BRACE20006980 
BRACE20007180 
BRACE20008850 
BRACE20009880 
BRACE20010650 
BRACE20010700 
BRACE20011170 
BRACE20011430 
BRACE20011880 
BRACE20013400 
BRACE20013520 
BRACE20013740 
BRACE20013750 
BRACE20014230 
BRACE20014530 
BRACE20014550 
BRACE20014770 
BRACE20014920 
BRACE20015080 
BRACE20015430 
BRACE20016730 
BRACE20016920 
BRACE20017370 
BRACE20018550 
BRACE200 18590 
BRACE20018650 
BRACE20018980 
BRACE20019440 
BRACE20020500 
BRAGE20020910 
BRACE20021510 
BRACE20021760 
BRACE20022020 
BRACE20022270 
BRACE20024090 
BRACE20024310 
BRACE20024680 
BRACE20024780 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



T62".605"""" 
1122. .1430 
229. .975 
35. .1663 
471. .2063 
1562. .2716 
711. .1013 
617.. 1144 
184. .498 
272.. 841 
514.. 816 
85. .1446 
1263. .1679 
596..>1809 
1761. 2069 
1257.. 1922 
1680.. >1989 
87.. 410 
1550. .2041 
742.. 11 04 
1495. .1890 
677. .1048 
1694.2317 
821. .1192 
1523, .1978 
1747. 2205 
921. .1838 
351. .1286 
1306. 1638 
39.. 545 
235..579 
445..828 
96 .569 
1612 .2082 
152..2008 
902.. 1408 
1143. .1529 
21 78. 2558 
225. 1205 
605.. 1372 
283.. 11 76 
1003.. 1653 
393.. 11 48 
71 7.. 1727 
3..353 
165..845 
2292.261 8 
65.. 1129 
297.. 11 54 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"BRACE2d024950 " 
BRACE20025900 
BRACE20026350 
BRACE20026850 
BRACE20027360 
BRACE20027520 
BRACE20027550 
BRACE20027720 
BRACE20027920 
BRACE20027960 
BRACE20028120 
BRACE20028600 
BRACE20028610 
BRACE20028960 
BRACE20030780 
BRACE20031100 
BRACE20032850 
BRACE20033190 
BRACE20033980 
BRACE20034310 
BRACE20034490 
BRACE20035160 
BRACE20035270 
BRACE20035390 
BRACE20035940 
BRACE20071380 
BRACE20071530 
BRACE20071740 
BRACE20071970 
BRACE20072010 
BRACE20072320 
BRACE20072810 
BRACE20074010 
BRACE20074470 
BRACE20075020 
BRACE20075270 
BRACE20075380 
BRACE20075630 
BRACE20076210 
BRACE20076410 
BRACE20076460 
BRACE20076630 
BRACE20076850 
BRACE20077080 
BRACE20077270 
BRACE20077610 
BRACE20077640 
BRACE20077670 
BRACE20077680 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



636.. 941 
106.. 753 
113.. 1063 
1147..1680 
613..1215 
168.578 
169.. 777 
105.. 1496 
1731. .2132 
408.. 91 7 
784.. 1593 
774.. 11 96 
63. .1472 
236.637 
214..2565 
698.. 1327 
202..531 
811. .1215 
526.. 1461 
1316. 1699 
492..896 
572.. 1099 
891 ..1211 



118., 1800 
1584. 1895 
422„1114 
266„583 
858,. 11 93 
92, .1246 
742„2109 
2248. 2553 
1740,. 2285 
1455, .1889 
996.. 1445 
850. 1509 
78, .1040 
1368. 1709 
634..996 
598.. 984 
11 9.. 853 
95.. 445 
239..553 
595..900 
317..649 
201 3.. 2390 
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Table 1 (continued) 



SEQ ID NO, of nucleotide 

sequence 



"BRACE2d077840 " 
BRACE20077980 
BRACE20078680 
BRACE20078820 
BRACE20079020 
BRACE20079530 
BRACE20080970 
BRACE20081140 
BRACE20083800 
BRACE20083850 
BRACE20084430 
BRACE20084800 
BRACE20084880 
BRACE20086530 
BRACE20086550 
BRACE20087080 
BRACE20087540 
BRACE20088570 
BRACE20089600 



BRACE20090140 
BRACE20091880 
BRACE20092120 
BRACE20092740 
BRACE20092750 
BRACE20093070 
BRACE20093110 
BRACE20093610 
BRACE20094370 
BRACE20095170 
BRAWH1 0000010 
BRAWH1 0000020 
BRAWH1 0000070 
BRAWH1 0000370 
BRAWH1 0000940 
BRAWH10001300 
BRAWH1 0001 620 
BRAWH1 0001 640 
BRAWH1 0001 680 
BRAWH1 0001 740 
BRAWH1 0001 800 
BRAWH20000340 
BRAWH20000480 
BRAWH20000930 
BRAWH20001090 
BRAWH20001770 
BRAWH20002480 
BRAWH20003230 
BRAWH20004430 



Position of CDS 



T8".524"' 
855.. 1604 
1144. 
2..580 
1178. .1681 
8..331 
1025. .1348 



SEQ ID NO. of amino acid 

sequence 



753 



14. 463 
71. .1201 
183.. 1046 
1035. .1649 
57.. 845 
1234. 1617 
405..914 
219..632 
31.. 342 
1013.. 1459 
42. 365 
121 ..630 



160.. 1959 
255..905 
1450. 1827 
562.. 11 58 
1581, .1940 
1337,.>1783 
826„1149 
1657, 1959 
1241, ,2050 
)„>1920 
560..2242 
161 .787 
203.2020 
256..2025 
109.. 1128 
185.. 613 
229.. 987 
277. 1383 
182..601 
2214..2537 
218..1117 
1521. .1904 
356..2014 
69.. 476 
1385. 1777 
232..780 
222..2186 



9 
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Table 1 (continued) 



"BRAWH200d4760" 
BRAWH20005030 
BRAWH20005220 
BRAWH20005540 
BRAWH20006330 
BRAWH20006510 
BRAWH20006860 
BRAWH20006970 
BRAWH20008660 
BRAWH20008920 
BRAWH20009010 
BRAWH20009440 
BRAWH20009840 
BRAWH20011030 
BRAWH20011290 
BRAWH20011410 
BRAWH20011660 
BRAWH20012030 
BRAWH20014180 
BRAWH20014380 
BRAWH20014610 
BRAWH20014840 
BRAWH20015030 
BRAWH20036890 
BRAWH20036930 
BRAWH20038320 
BRAWH20040950 
BRAWH20047310 
BRAWH20052250 
BRAWH20059980 
BRAWH20060440 
BRAWH20064500 
BRAWH20064930 
BRAWH20066220 
BRAWH20069600 
BRAWH20069890 
BRAWH20074060 
BRAWH20076050 
BRAWH20087060 
BRAWH20089030 
BRAWH20089560 
BRAWH20092270 
BRAWH20092610 
BRAWH20093600 
BRAWH20094850 
CD34C20000510 
CTONG20003030 
CTONG20005890 
CTONG20007710 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



22" .564 
1431. .1754 
98. .700 
416..811 
854.. 1741 
438.. 11 63 
599..2179 
1178..1579 
505.. 924 
21 79. 2529 
1139. .1540 
422.. 1603 
826.. 1764 



SEQ ID NO. c 
sequence 



217 
218 



223 
224 



232 
233 



..218f 



71. .1444 
251. .71 8 
104.. 1411 
55. .684 
5.. 751 
795.. 11 30 
743.. 121 6 
404. .2227 
861. .1232 
31 9. .864 
28. 3078 
998.. 1306 
1867 .2205 
47.547 
30.398 
161 ..1900 
65.. 2083 
209.. 1675 
403.. 1908 
1395 .>2080 
1482. 1976 
444.. 1226 
1028. ,1429 
45. .1469 
1160. .2053 
346. 825 
87.2906 
223.. 1062 
258..791 
21.. 644 
1213.. 1536 
7.. 1380 
2228.2641 
294..3017 
1088.. 1468 



10 
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Table 1 (continued) 



CT6NG20d08270" 

CTONG20011390 

CTONG20013200 

CTONG20013660 

CTONG20015330 

CTONG20018200 

CTONG20019110 

CTONG20019550 

CTONG20020730 

CTONG20021430 

CTONG20024180 

CTONG20024530 

CTONG20025580 

CTONG20027210 

CTONG20028030 

CTONG20028160 

GTONG20028200 

CTONG20029650 

CTONG20037820 

CTONG20047160 

GTONG20055530 

GTONG20064490 

D30ST20001840 

DFNES20002120 

DFNES20002680 

DFNES20002920 

DFNES20003350 

DFNES20004320 

FCBBF1 0005980 

FCBBF1 00061 80 

FCBBF1 0006750 

FCBBF1 0006860 

FCBBF1 0006870 

FCBBF1 0006910 

FCBBF1 0007320 

FCBBF1 0007600 

FCBBF20000940 

FCBBF20001050 

FCBBF20001950 

FCBBF20002320 

FCBBF20002760 

FCBBF20005760 

FCBBF20005910 

FCBBF20006770 

FCBBF20007330 

FCBBF20008080 

FCBBF20008150 

FCBBF20009400 

FGBBF20009510 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



279 
280 



295 
296 



50.3337 
123.. 1823 1823 
3..1571 
87.527 
42..>3217 
511. .1455 
162..>3684 
20.. 1342 
74..>2745 
641. .2452 
618.. 986 
1281. .2177 
366..>2763 
837. 1280 
301. .2799 
148..>3543 
118..>2551 
332..895 
113.. 1021 
63..>2305 
1533, .1859 
77.. 1429 
1263.. 1940 
505..2571 
52..639 
33. 1007 
1517. 1849 
375..2141 
1161. .1463 
106.. 1023 
521 ..865 
410..1831 
111. .482 
1236.. 1850 
68. 523 
249..2147 
29. 421 
1565, .1888 
921. .2003 
221. .895 
693.. 1073 
29..>2161 
265..600 
745.. 1077 
180..821 
641 ..1762 
360..749 
21 8.. 1333 



11 
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Table 1 (continued) 



"FCBBF200121T0"" 
FCBBF20012990 
FCBBF20014800 
FCBBF20015380 
FCBBF20016720 
FCBBF20017180 
FCBBF20017200 
FCBBF40002820 
FCBBF50002610 
FEBRA20000350 
FEBRA20000530 
FEBRA20001050 
FEBRA20001290 
FEBRA20003110 
FEBRA200033Q0 
FEBRA20003770 
FEBRA20003780 
FEBRA20003910 
FEBRA20003970 
FEBRA20003990 
FEBRA20004040 
FEBRA20004150 
FEBRA20004520 
FEBRA20004540 
FEBRA20004910 
FEBRA20005360 
FEBRA20006560 
FEBRA20006800 
FEBRA20006900 
FEBRA20007330 
FEBRA20007400 
FEBRA20007570 
FEBRA20007710 
FEBRA20007720 
FEBRA20007870 
FEBRA20008090 
FEBRA20008560 
FEBRA20008740 
FEBRA20008800 
FEBRA20008810 
FEBRA20009010 
FEBRA20009590 
FEBRA20009720 
FEBRA20010930 
FEBRA20011330 
FEBRA20011460 
FEBRA20011970 
FEBRA20012270 
FEBRA20012450 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



T36"T77"6""" 
758.. 11 68 
146.. 1957 
302.. 1246 
51.. 488 
236.. 598 
2306 .2767 
207.. 1247 
126. .1634 
724.. 1764 
359.. 2221 
496.. 2355 
872.. 11 92 
386. 964 
1504 .1881 
398..2464 
454..873 
1728..2057 
162.. 1469 
845..2404 
21.. 449 
836..2494 
11 3.. 442 
934..2484 
1417.. 1926 
254.. 1453 
34.615 
303. 1040 
1268, .1669 
1486, .2013 
32, .1123 
222.. 11 93 
1059 .1361 
267.689 
1174..1788 
1319, .1621 
1500, .1991 
2051. 2368 
38. 703 
377.. 1495 
243..638 
1345.. 1677 
530..2140 
239.. 1249 
1795.. 2322 
221 9.. 2602 
1672. 1977 
1933. 2238 
519..2678 



SEQ ID NO. c 
sequence 
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Table 1 (continued) 



"FEBRA2W12940"" 
FEBRA20013510 
FEBRA20014870 
FEBRA20014920 
FEBRA20015840 
FEBRA20015900 
FEBRA20015910 
FEBRA20017060 
FEBRA20017150 
FEBRA20017900 
FEBRA20019890 
FEBRA20020860 
FEBRA20021910 
FEBRA20021940 
FEBRA20024290 
FEBRA20024420 
FEBRA20025250 
FEBRA20027270 
FEBRA20027830 
FEBRA20028820 
FEBRA20028970 
FEBRA20029080 
FEBRA20030540 
FEBRA20031550 
FEBRA20033080 
FEBRA20034290 
FEBRA20037070 
FEBRA20041100 
FEBRA20041910 
FEBRA20042240 
FEBRA20042370 
FEBRA20042930 
FEBRA20043250 
FEBRA20043290 
FEBRA20044120 
FEBRA20044430 
FEBRA20044900 
FEBRA20045920 
FEBRA20048180 
FEBRA20050140 
FEBRA20050790 
FEBRA20052160 
FEBRA20053770 
FEBRA20053800 
FEBRA20054270 
FEBRA20057260 
FEBRA20057520 
FEBRA20057780 
FEBRA20057880 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



■69:.575 
990..1313 
1567.. 1884 
77..2815 
702.. 1853 
1535. 1882 
246.617 
730.. 1257 
391 ..2799 
72. 647 
497..2239 
281 ..673 
1455.. 1829 
17.493 
35. 1933 
991 ..1614 
251 ..21 64 
10.930 
293..610 
1337.. 1678 
824.. 1303 
90. 764 
292..993 
2000. 2365 
399.. 749 
348.. 854 
1830 .>2246 
198.. 1010 
39.425 
1373.. 1714 
51.. 938 
2321..>2652 
394..>2294 
57.2984 
928.. 1263 
192..539 
78. .1763 
344.. 1438 
86. 493 
727.2325 
780. 1295 
1055.. 1411 
790.. 11 97 
213..521 
1711.. 2058 
8.. 1789 
13. 480 
150..641 
8..>3165 



SEQ ID NO. of amino acid 

sequence 



408 
409 
410 
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Table 1 (continued) 



"FEBRA2"005998d" " 
FEBRA20060920 
FEBRA20061500 
FEBRA20062700 
FEBRA20063150 
FEBRA20063540 
FEBRA20064760 
FEBRA20066270 
FEBRA20066670 
FEBRA20067360 
FEBRA20067930 
FEBRA20068730 
FEBRA20069420 
FEBRA20070170 
FEBRA200720Q0 
FEBRA20072800 
FEBRA20074140 
FEBRA20074580 
FEBRA20075510 
FEBRA20075660 
FEBRA20076220 
HCASM 10000210 
HCASM 10000610 
HCASM 10001 150 
HCASM20002020 
HCASM20002140 
HCASM20003070 
HCASM20005340 
HCASM20005360 
HEART20000350 
HEART20000990 
HEART20003090 
HEART20004110 
HEART20004480 
HEART20004920 
HEART20005060 
HEART20005200 
HEART20005680 
HHDPC20000550 
HHDPC20000950 
HHDPC20001150 
HHDPC20001490 
HHDPC20003150 
HHDPC20004550 
HHDPC20004560 
HHDPC20004620 
HLUNG1 0000240 
HLUNG1 0000300 
HLUNG1 0000370 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



T160.:2"0"17"" 
26..661 
77.. 460 
32..778 
236..538 
1402.. 1737 
340..2076 
278..691 
1933. 2496 
160.. 1713 
1332. .1973 
171. .2051 
231. .1439 
88.921 
47.2065 
1850. 2335 
57..371 
91.. 420 
181 ..606 
1960..2298 
147..2525 
384.. 11 54 
317..871 
473..868 
152..469 
408.. 11 36 
48 .1865 
1003, .1368 
2211.. 261 8 
514.. 1290 
1341. .1670 
1133. .1657 
39 .1415 
863.1171 
159..851 
95, .1495 
938.. 1375 
783.. 11 39 
55. 1320 
338.. 1462 
326..925 
337..2265 
506..1711 
197..2065 
174..>2424 
17.. 865 
1439.. 1762 
205..555 
14.. 1930 



SEQ ID NO. of amino acid 

sequence 



439 
440 



457 
458 
459 
460 
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Table 1 (continued) 



'hlungToooomo " 

HLUNG1 0000760 
HLUNG1 0000990 
HLUNG1 0001 050 
HLUNG1G001100 
HLUNG20000680 
HLUNG20001160 
HLUNG20001250 
HLUNG2G001420 
HLUNG20001760 
HLUNG20002550 
HLUNG20003140 
HLUNG20004120 
HLUNG20004800 
HLUNG2G005010 
HSYRA1 0001 190 
HSYRA1 0001 370 
HSYRA1 0001 480 
HSYRA1 0001 680 
HSYRA1 0001 780 
HSYRA20001350 
HSYRA20002480 
HSYRA20002530 
HSYRA20003470 
HSYRA20005100 
HSYRA20006050 
HSYRA20006290 
HSYRA20006400 
HSYRA20007600 
HSYRA20008280 
HSYRA20011030 
HSYRA20011530 
HSYRA20013320 
HSYRA20014200 
HSYRA20014760 
HSYRA20015740 
HSYRA20015800 
HSYRA20016210 
HSYRA20016310 
IMR321 0000440 
IMR321 0000740 
IMR321 0000750 
IMR3210001580 
IMR3210001650 
I MR321 0002420 
I MR32 10002660 
IMR3220002230 
IMR3220003020 
IMR3220006090 



SEQ ID NO. of nucleotide 
sequence 



Position of CDS 



T44:.T514""" 
80. 1246 
67.. 2370 
1081. .1614 
302.. 703 
187..1527 
391. .1434 
360.899 
173. 1600 
6..524 
1101. .1865 



37.966 
159..1712 
194.. 1903 
27..614 
59..>2245 
598.. 1023 
218..2371 
670..972 
214..780 
1134..1580 
138..1379 
66.. 1199 
222.. 11 60 
648.. 1052 
389.. 1207 
1492. 1821 
572..>2236 
653.. 955 
3.. 956 
326. 1237 
46 .783 
11 7.. 947 
1100. .1435 
81 ,470 
317. .1105 
313.. 1560 
1215, .1616 
722,, 1054 
125,, 1249 
903,, 1391 
194,, 1255 
38.. 1423 
25.,945 
403,, 1425 
8„442 



SEQ ID NO, of amino acid 

sequence 



503 
504 



507 
508 
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Table 1 (continued) 



1MR3226007420" 
IMR3220007750 
IMR3220007910 
IMR3220008380 
IMR3220008590 
IMR3220008630 
IMR3220009190 
IMR3220009350 
IMR3220009530 
IMR3220009730 
IMR3220009840 
IMR3220011850 
IMR3220012180 
IMR3220013170 
IMR3220Q13320 
IMR3220014350 
IMR3220014910 
IMR3220016000 
IMR3220017240 
KIDNE1 0000080 
KIDNE1 0000280 
KIDNE1 0000500 
KIDNE10001040 
KIDNE10001430 
KIDNE10001450 
KIDNE10001520 
KIDNE20000410 
KIDNE20000510 
KIDNE20000700 
KIDNE20000850 
KIDNE20001670 
KIDNE20001920 
KIDNE20002440 
KIDNE20002450 
KIDNE20002660 
KIDNE20003150 
KIDNE20003300 
KIDNE20003490 
KIDNE20003750 
KIDNE20004030 
KIDNE20004220 
KIDNE20004970 
KIDNE20005130 
KIDNE20005170 
KIDNE20005190 
KIDNE20005740 
KIDNE20O3185 
KIDNE20033050 
KIDNE20033350 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



T5:.6T4 
197.. >1 670 
934. .1515 
228.. 11 42 
756.. 1061 
94.1197 
31. .1293 
20. .421 
2..394 
102..>2141 
246..554 
271 ..1026 
76. 1521 
393. 1028 
22.1 140 
807..2027 
79..639 
8..532 
1114.. 1458 
346.. 1233 
1864.2181 
662.. 11 08 
115.. 1656 
560.. 1051 
7..483 
74.712 
712. .1215 
188. 1717 
843..2135 
19, .1071 
727.. 1725 
1648, .2070 
163. .468 
1273, .1653 
1929. 2249 
926.. 1306 
1205, .2230 
219.. 1523 
732. 1568 
194..3142 
1699.221 7 
274.. 1479 
404.. 141 7 
57.. 944 
530.. 1045 
32.. 1309 
1050..>1998 
121 ..1719 
1999.. 2304 



SEQ ID NO. of amino acid 

sequence 



556 
557 
558 
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Table 1 (continued) 



"KIDNE20033570" 
KIDNE20033730 
KIDNE20033770 
KIDNE20037520 
KIDNE20039410 
KIDNE20039940 
KIDNE20040340 
KIDNE20040540 
KIDNE20040840 
KIDNE20042620 
KIDNE20042940 
KIDNE20042950 
KIDNE20043440 
KIDNE20044110 
KIDNE20045200 
KIDNE20045340 
KIDNE20045790 
KIDNE20046810 
KIDNE20048280 
KIDNE20048640 
KIDNE20048790 
KIDNE20049810 
KIDNE20050420 
KIDNE20052960 
KIDNE20053360 
KIDNE20054000 
KIDNE20054770 
KIDNE20056290 
KIDNE20056760 
KIDNE20059080 
KIDNE20059370 
KIDNE20060140 
KIDNE20060300 
KIDNE20060530 
KIDNE20060620 
KIDNE20061490 
KIDNE20062480 
KIDNE20062990 
KIDNE20063530 
KIDNE20063760 
KIDNE20066520 
KIDNE20067600 
KIDNE20067750 
KIDNE20068800 
KIDNE20070050 
KIDNE20070770 
KIDNE20071860 
KIDNE20073280 
KIDNE20073520 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



408..2366 
1256. .1603 
1032.. 1637 
263..928 
133.. 11 34 
460. 768 
5.. 1276 
5..>3343 
1312. .1716 
213..611 
72. 476 
49..2136 
167..1708 
1455. .1955 
44.919 
1561. 2085 
398.. 1066 
124..2010 
1437. 1775 
348.. 701 
909. .21 08 
183..2351 
1105. .1461 
1054,. 1470 
1246, .1596 
99, .1511 
398.. 1306 
375.. 1493 
44..>2546 
1683..2018 
52.. 1572 
42 .530 
136.2208 
31. .687 
824.. 1327 
649. .963 
72, .1334 
710. .1105 
1332, .1724 
11 2., 435 
677.. 1795 
54. 2228 
1048. .1473 
1672.. 2001 
610.. 1845 
1662. 1973 
24.1898 
332..2140 
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Table 1 (continued) 



"KIDNE20073560" " 
KIDNE20074220 
KIDNE20075690 
KIDNE20078100 
KIDNE20078110 
LIVER1 0000580 
LIVER1 0000670 
LIVER1 0000790 
LIVER1 0000990 
LIVER1 0001 040 
LIVER10001110 
LIVER1 0001 750 
LIVER1 0002300 
LIVER1 0002780 
LIVER1 0003030 
LIVER1 0004330 
LIVER1 0005420 
LIVER20000330 
LIVER20000370 
LIVER20004160 
LIVER20004460 
LIVER20005150 
MAMGL1 0000320 
MAMGL1 0000350 
MAMGL1 0000560 
MAMGL1 0001 780 
MAMGL1 0001 820 
MAMGL1 0001 840 
MESAN1 0000350 
MESAN1 0001 010 
MESAN10001470 
MESAN1 0001 800 
MESAN20000920 
MESAN20001490 
MESAN20002670 
MESAN20002910 
MESAN20003370 
MESAN20005010 
NB9N41 0000470 
NB9N410001210 
NB9N41 0001 350 
NB9N41 0001 460 
NB9N420000420 
NB9N420001040 
NB9N420004950 
NESOP 10000870 
NHNPC1 0000840 
NHNPC10001010 
NHNPC1 0001 240 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



'171.497 
1496. 1879 



SEQ ID NO. of amino acid 

sequence 



231..! 



714..1373 
166.. 1482 
1704.. 2 108 
33 .2063 
98.. 1072 
1451. 2065 
710.. 1759 
549.. 878 
1737.2156 
861 ..1565 
2235 .2699 
2310.2621 
22..2130 
131 ..2035 
135..839 
1158..2138 
1625.. 1966 
882. 1307 
1453 .2262 
71. .1792 
307.. 2721 
78. 623 
1019, .1618 
92. .1405 
636., 1181 
362.. 1843 
54. 2279 
1571, .1906 
482,. 1900 
546..2213 
652..>2706 
1092. 1535 
1625 ,2269 
198..521 
183.. 1550 
329..>1714 
74.1786 
194..619 
678.. 11 36 
596.. 928 
12.. 2030 
1929.. 2648 
793.. 1554 
720..>1934 
119.541 
1513.. 1860 
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Table 1 (continued) 



Clone name 



■NHNPC20002060" 
NHNPC20002120 

NT2NE1 0000040 
NT2NE1 00001 40 
NT2NE1 00001 80 
NT2NE1 0000230 
NT2NE1 0000630 
NT2NE1 0000730 
NT2NE1 0000830 
NT2NE1 0001 200 
NT2NE1 0001 630 
NT2NE1 0001 850 
NT2NE20000380 
NT2NE20000560 
NT2NE20000640 
NT2NE20001740 
NT2NE20002140 
NT2NE20002590 
NT2NE20002990 
NT2NE20003270 
NT2NE20003690 
NT2NE20003840 
NT2NE20003920 
NT2NE20004550 
NT2NE20004700 
NT2NE20005170 
NT2NE20005360 
NT2NE20005500 
NT2NE20005860 
NT2NE20006360 
NT2NE20006580 
NT2NE20007060 
NT2NE20007630 
NT2NE20007870 
NT2NE20008020 
NT2NE20008090 
NT2NE20009800 
NT2NE20011560 
NT2NE20012470 
NT2NE20013240 
NT2NE20013370 
NT2NE20013640 
NT2NE20013720 
NT2NE20014030 
NT2NE20014280 
NT2NE20014350 
NT2NE20015300 
NT2NE20016230 
NT2NE20016260 



SEQ ID NO. of nucleotide 
sequence 



Position of CDS 



Tl3rT3"51 
98..1195 
274.. 61 8 
232..>2244 
131..1189 
172.. 483 
175.. 1317 
585.. 1364 
556.. 11 79 
431. .874 
600..1013 
23.. 1960 
408..962 
93..>1858 
731 ..1063 

I. .1638 
56.. 2053 
233..961 
117.. 1565 
127..>2256 
534..893 
91. 2403 
372., 749 
31,. 852 
1569, 2048 
217„990 
1412, .1717 
574.. 1605 
289..996 
1383..>2954 
937..2073 
640.. 1053 

II. . 679 
508.. 1092 
420.. 776 
328.. 1995 
547.. 11 64 
78. .1037 
204. .61 4 
747. 1079 
1229. 2533 
1854.. 2264 
159.. 695 
1466. 2263 
439.. 903 
684.. 11 90 
120..>2517 
713..1111 
528.. 1721 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"NT2NE20016340" 
NT2NE20016480 
NT2NE20016660 
NT2NE20016970 
NT2NE20034080 
NT2NE20035690 
NT2NE20044900 
NT2NE20047160 
NT2NE20053710 
NT2NE20054410 
NT2NE20055170 
NT2NE20057200 
NT2R1 100001 60 
NT2R1 10000270 
NT2R1 10000480 
NT2R1 10001 640 
NT2RI20000640 
NT2RI20002700 
NT2RI20002820 
NT2RI20002940 
NT2RI20003410 
NT2RI20004120 
NT2RI20004210 
NT2RI20005970 
NT2RI20006690 
NT2RI20006710 
NT2RI20006850 
NT2RI20007380 
NT2RI20008650 
NT2RI20009740 
NT2RI20010100 
NT2RI20010830 
NT2RI20010910 
NT2RI20012350 
NT2RI20012440 
NT2RI20013420 
NT2RI20013850 
NT2RI20014090 
NT2RI20014100 
NT2RI20014490 
NT2RI20014500 
NT2RI20015190 
NT2RI20015400 
NT2RI20015950 
NT2RI20016210 
NT2RI20016570 
NT2RI20017260 
NT2RI20018460 
NT2RI20018660 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



349. .91 8 
48.566 
119.. 1606 
969..>2204 
431. .1039 
658.. 1707 
406.. 1020 
975.. 171 5 
50. 955 
205.. 879 
290.. 1270 
330.. 809 
502. 1068 
4..>2060 
1391. .1894 
849.. 1253 
195.. 1781 
63.. 467 
148.. 1878 
23.. 1600 
477.. 1367 
309.. 1562 
206..>2507 
884.. 1654 
550. 2235 
843.. 1208 
692.. 1045 
287. .595 
150.. 1727 
295.. 2325 
361 ..1254 
969.. 1331 
1026, .1367 
6..506 
838.. 1497 
57.. 1739 
1280, .1612 
244.. 2247 
451. .2331 
388..1671 
454..2277 
575.. 1078 
571 ..1113 
238.. 1026 
1143..1703 
64..>2603 
245.. 1672 
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Table 1 (continued) 



"NT2RI 26020220 " 
NT2RI20020410 
NT2RI20021520 
NT2RI20022430 
NT2RI20022520 
NT2RI20022700 
NT2RI20025170 
NT2RI20025300 
NT2RI20025410 
NT2RI20025540 
NT2RI20025850 
NT2RI20026540 
NT2RI20028020 
NT2RI20028520 
NT2RI20029260 
NT2RI20029580 
NT2RI20029700 
NT2RI20030110 
NT2RI20030190 
NT2RI20030510 
NT2RI20030670 
NT2RI20031540 
NT2RI20032050 
NT2RI20032220 
NT2RI20033010 
NT2RI20033040 
NT2RI20033380 
NT2RI20033440 
NT2RI20033830 
NT2RI20035560 
NT2RI20036780 
NT2RI20036950 
NT2RI20037510 
NT2RI20040590 
NT2RI20041900 
NT2RI20042840 
NT2RI20043040 
NT2RI20043980 
NT2RI20044420 
NT2RI20046060 
NT2RI20047830 
NT2RI20048400 
NT2RI20049160 
NT2RI20049840 
NT2RI20049850 
NT2RI20050610 
NT2RI20050870 
NT2RI20051500 
NT2RI20053350 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



505.. 1482 
1..1350 
1165..1629 
532.. 1401 
90 .1727 
570..>2759 
15. .1757 
170..>2080 
245.. 181 6 
191. .1555 
1 ..420 
120..692 
1253 .1714 
350..2023 
312..881 
248..862 
43.. 357 
1317.. 1715 
714..1046 
84.. 1538 
698..2797 
640..>2800 
652. .2898 
408. .794 
121. .1374 
650.. 1687 
165.. 929 
27.. 1883 
51 3. .2585 
355.. 2445 
591 ..1589 
605. 1597 
86 .442 
760.. 1083 
236.. 1867 
76.. 549 
1080. .1454 
782.. 1480 
297.623 
16.. 324 
668.. 1054 
1266. .2060 
783.. 1973 
19..2301 
101 ..2056 
236.. 1246 
128..2125 
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Table 1 (continued) 



"NT2RI 26053680 " 
NT2RI20055640 
NT2RI20056280 
NT2RI20056470 
NT2RI20057230 
NT2RI20058110 
NT2RI20058510 
NT2RI20060710 
NT2RI20060720 
NT2RI20061270 
NT2RI20061830 
NT2RI20062100 
NT2RI20063450 
NT2RI20064120 
NT2RI20064870 
NT2RI20065060 
NT2RI20065530 
NT2RI20066670 
NT2RI20066790 
NT2RI20066820 
NT2RI20067030 
NT2RI20067350 
NT2RI20067880 
NT2RI20068250 
NT2RI20068550 
NT2RI20070480 
NT2RI20070840 
NT2RI20070960 
NT2RI20071160 
NT2RI20071330 
NT2RI20071480 
NT2RI20072140 
NT2RI20072540 
NT2RI20073030 
NT2RI20073840 
NT2RI20073860 
NT2RI20074390 
NT2RI20074690 
NT2RI20074980 
NT2RI20075070 
NT2RI20075720 
NT2RI20075890 
NT2RI20077230 
NT2RI20077290 
NT2RI20077510 
NT2RI20077540 
NT2RI20078270 
NT2RI20078790 
NT2RI20078840 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



3"10:.;2430"" 
68.. 1171 
1493. .1954 
272..2098 
118..>1116 
145..1566 
107.2050 
274.. 1278 
78. 2354 
14. 352 
906.. 1580 
1465. 2487 
278..745 
281 ..1321 
647. 11 38 
63..1193 
644.. 11 35 
23.. 598 
272.. 1747 
145.. 11 73 
58.. 1368 
100..>2451 
25..732 
123..2246 
250.. 1656 
483.. 1844 
25 .600 
212..>2749 
921. .1643 
159. .21 05 
82. .11 55 
845. .1192 
483.. 1394 
914..1351 
202.. 1569 
150..506 
60. .1916 
430..>2130 
175.. 1269 
1064. 1513 
193. 1758 
927.. 1520 
79. .741 
101 ..793 
455.. 766 
527.. 1075 
61. .717 
391 ..1020 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"NT2RI26d78910" 
NT2RI20080500 
NT2RI20081880 
NT2RI20082210 
NT2RI20083360 
NT2RI20083960 
NT2RI20084810 
NT2RI20085260 
NT2RI20085980 
NT2RI20086560 
NT2RI20087140 
NT2RI20087490 
NT2RI20087910 
NT2RI20088010 
NT2RI20088120 
NT2RI20089420 
NT2RI20090650 
NT2RI20090660 
NT2RI20090830 
NT2RI20091440 
NT2RI20092150 
NT2RI20092890 
NT2RI20094060 
NT2RP60000080 
NT2RP60000170 
NT2RP60000320 
NT2RP60000350 
NT2RP60000390 
NT2RP60000590 
NT2RP60000720 
NT2RP60000860 
NT2RP60001000 
NT2RP60001090 
NT2RP60001230 
NT2RP60001270 
NT2RP70000410 
NT2RP70000690 
NT2RP70000760 
NT2RP70002380 
NT2RP70002590 
NT2RP70002710 
NT2RP70003640 
NT2RP70003910 
NT2RP70004250 
NT2RP70004770 
NT2RP70005790 
NT2RP70006240 
NT2RP70008120 
NT2RP70009060 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



334" 1524""" 
229.. 1767 
158. .1180 
449.. 11 20 
60. 1898 
164.. 919 
863. 1384 
347.. 709 
357..>2020 
478.. 798 
108..443 
498.. 1862 
508.. 1011 
31 6.. 759 
10. .684 
626.. 1099 
41. .1594 
65.. 1852 
988..1713 
216..1037 
574..>2918 
651 ..2471 
17.. 1030 
2249..2614 
128..928 
27..2108 
553.. 1734 
455.. 829 
2067.2381 
1136. .2104 
7.. 660 
474.. 1694 
442.21 36 
78 .1937 
858.. 1652 
801. .1103 
259. .3270 
954.. 1295 
231. .1208 
202. 993 
26A.. 1619 
2617..2994 
1492. .1797 
261 ..1115 
1132. .2541 
940.. 1257 
26.. 1756 
90. 476 
2197..3105 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"NT2RP70016800" 
NT2RP70011660 
NT2RP70012310 
NT2RP70013060 
NT2RP70013350 
NT2RP70015910 
NT2RP70018560 
NT2RP70021510 
NT2RP70022430 
NT2RP70023760 
NT2RP70023790 
NT2RP70024490 
NT2RP70024500 
NT2RP70025540 
NT2RP70026190 
NT2RP70028290 
NT2RP70028410 
NT2RP70028750 
NT2RP70029060 
NT2RP70029820 
NT2RP70030500 
NT2RP70030550 
NT2RP70030910 
NT2RP70032030 
NT2RP70033040 
NT2RP70036290 
NT2RP70036320 
NT2RP70036470 
NT2RP70036800 
NT2RP70039600 
NT2RP70040800 
NT2RP70042040 
NT2RP70042330 
NT2RP70042600 
NT2RP70043730 
NT2RP70043960 
NT2RP70045410 
NT2RP70046560 
NT2RP70046870 
NT2RP70047510 
NT2RP70047660 
NT2RP70047900 
NT2RP70049150 
NT2RP70049250 
NT2RP70049750 
NT2RP70052050 
NT2RP70052190 
NT2RP70054680 
NT2RP70054930 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



T32".2060" 
5..3619 
8.. 1756 
1533. 2693 
396..2096 
14.. 1492 
167.. 1561 
209..538 
269..2617 
92. 3292 
1022. 2797 
223..888 
333.. 1652 
191 ..532 
1277..>3059 
52.21 69 
363..983 
69.. 1226 
283..2847 
199..2325 
58. 480 
148.. 1854 
21 86, .2785 
214.. 1584 
294.. 11 66 
490., 2808 
550.. 1002 
340 ,3312 
151 ,,2307 
254„3121 
592,, 1587 
855.. 1946 
434..2908 
428.2539 
312..2828 
17.3181 
82..>3154 
41. 2593 
47, .2995 
29, .394 
958,, 1332 
1553,. 1870 
436..2349 
100.. 1356 
1483. .2013 
2014..2481 
24.. 41 3 
1192..1563 
214..579 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"NT2RP70055620" 
NT2RP70055130 
NT2RP70055200 
NT2RP70061620 
NT2RP70061880 
NT2RP70062960 
NT2RP70063040 
NT2RP70063740 
NT2RP70064080 
NT2RP70064900 
NT2RP70065270 
NT2RP70066210 
NT2RP70067010 
NT2RP70069800 
NT2RP70069860 
NT2RP70071140 
NT2RP70071540 
NT2RP70071770 
NT2RP70072210 
NT2RP70072520 
NT2RP70073590 
NT2RP70073810 
NT2RP70074060 
NT2RP70074220 
NT2RP70075040 
NT2RP70075370 
NT2RP70076100 
NT2RP70076170 
NT2RP70076430 
NT2RP70079250 
NT2RP70079300 
NT2RP70079750 
NT2RP70081330 
NT2RP70081370 
NT2RP70081420 
NT2RP70081440 
NT2RP70081670 
NT2RP70083150 
NT2RP70084060 
NT2RP70084410 
NT2RP70084870 
NT2RP70085500 
NT2RP70085570 
NT2RP70086230 
NT2RP70087200 
NT2RP70088550 
NT2RP70090120 
NT2RP70090190 
NT2RP70091490 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



774".Tf42""" 
175.. 1791 
1388. .1879 
1241. 251 8 
161..>3222 
604..3786 
277. 1701 
525..908 
189.. 974 
74. 2332 
261.. 2756 
1294. 1926 
342..662 
1520 .1972 
132.. 1964 
1812..2342 
694.. 1776 
28..2103 
205.. 1944 
79..>3136 
2133.2525 
171 ..683 
248..739 
367..726 
365..>3111 
253.. 1710 
280..2415 
74.1717 
461. .2788 
365. .3256 
240.. 554 
954. .2867 
187.. 1458 
279.2876 
1246, .1719 
1013, .1501 
47 ,3034 
118.,>3979 
47. 847 
115..>3256 
70. 1422 
170..3274 
302..2035 
799.. 1203 
211.. 2583 
57.. 1529 
52. 2397 
1358.. 2545 
18..764 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"NT2RP7009T680"" 
NT2RP70092150 
NT2RP70092360 
NT2RP70092590 
NT2RP70093220 
NT2RP70093630 
NT2RP70093700 
NT2RP70093730 
NT2RP70093940 
NT2RP70093970 
NT2RP70094290 
NT2RP70094660 
NT2RP70094810 
NT2RP70094980 
NT2RP70095020 
NT2RP70095070 
NT0NG1 0000330 
NT0NG1 0000520 
NT0NG1 0000980 
NT0NG1 0001 230 
NT0NG1 0001 300 
NT0NG1 0001 820 
NTONG10002140 
NTONG1 0002460 
NTONG1 0002570 
NTONG1 0002640 
NTONG20002650 
NTONG20003340 
NTONG20003630 
NTONG20004920 
NTONG20005830 
NTONG20008000 
NTONG20008780 
NTONG20009660 
NTONG20009850 
NTONG20011370 
NTONG20012220 
NTONG20014280 
NTONG20015500 
NTONG20016120 
0CBBF1 0000420 
0CBBF1 0000670 
0CBBF1 0000860 
0CBBF1 0000910 
0CBBF1 0001 040 
0CBBF1 00011 80 
OCBBF10001190 
OCBBF1 0001 220 
OCBBF20000130 



SEQ ID NO. of nucleotide 

sequence 

1000" 
1001 



1005 
1006 
1007 



1040 
1041 



1044 
1045 
1046 
1047 
1048 



Position of CDS 

"2064:26T8 
5..481 
116..>3870 
148..2256 
333.. 2783 
1372.. 1710 
124.2220 
212..2389 
135.. 2909 
830.. 1705 
705.. 1046 
76. 393 
94.3705 
38 .2905 
2451 .2768 
228..638 
9..2423 
83.. 1501 
576.. 1703 
318.. 1967 
223.. 1839 
677. .1558 
14, .1207 
242. .1561 
107.. 658 
263.. 21 31 
177. .2696 
322.996 
183..>2114 
1693, .1998 
268.. 624 
160.. 1503 
264.. 1829 
12, .1199 
32. 388 
123.. 440 
109..447 
47.. 367 
388.. 1416 
27 .2267 
457.894 
694.. 11 49 



869.. 1288 
108..977 
171 ..2444 
235..2259 
211.. 2301 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



OCBBF2d00"l260 " 

OCBBF20002310 

OCBBF20002770 

OCBBF20002870 

OCBBF20007190 

OCBBF20008240 

OCBBF20009040 

OCBBF20009980 

OCBBF20010750 

OCBBF20011010 

OCBBF20011240 

OCBBF20011400 

OCBBF20011760 

OCBBF20012100 

OCBBF20013070 

OCBBF20014020 

OCBBF20014080 

OCBBF20014940 

OCBBF20015270 

OCBBF20015280 

OCBBF20015860 

OCBBF20017060 

PANCR1 0000210 

PANCR1 0001 850 

PEBLM1 0000290 

PEBLM1 0000340 

PEBLM1 0000680 

PEBLM1 0001440 

PEBLM10001800 

PEBLM20000300 

PEBLM20001120 

PEBLM20001260 

PEBLM20001470 

PEBLM20002130 

PEBLM20002480 

PEBLM20002700 

PEBLM20003080 

PEBLM20003950 

PEBLM20004790 

PLACE50000370 

PLACE50000580 

PLACE50000670 

PLACE50000680 

PLACE50000800 

PLACE50001050 

PLACE50001130 

PLACE50001530 

PLACE50001700 

PLACE60000440 



SEQ ID NO. of nucleotide 

sequence 



" 1049"" 
1050 
1051 
1052 
1053 
1054 
1055 
1056 



1060 
1061 
1062 



Position of CDS 



■l"7"33:2l"0"7" " 
302. 1870 
19.. 1374 
1874..>2191 
457..2277 
46.21 69 
273..2030 
152.526 
221. .655 
1..1323 
313..>2823 
144..>3731 
139..1815 
107..1840 
177..1199 
174..>2999 



539..>2338 
79..>2727 
201. .827 
1065. .1463 
42. .863 
77. .379 
1294,. 1722 
12. .1814 
1330. 1923 
154..2799 
1072. .1509 
1538. .2563 
175.. 2934 
2535. .291 8 
306. .698 
35 .979 
162.. 1334 
170,. 1762 
669.. 1685 
285,, 881 
341 ,,1798 
986„1912 
387. 3224 
2506. 2874 
407..2119 
488..>3266 
417..2234 
12.. 4397 
1874.2224 
80..2140 
1255.. 1575 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"PLACE60000700" 
PLACE60000800 
PLACE60001370 
PLACE60002050 
PLACE60002630 
PLACE60003710 
PLACE60003790 
PLACE60004240 
PLACE60004290 
PLACE60005230 
PLACE60005500 
PLACE60005550 
PLACE60009530 
PLACE60012810 
PLACE60012940 
PLACE60014430 
PLACE60018860 
PLACE60019230 
PLACE60019250 
PLACE60020160 
PLACE60020840 
PLACE60021020 
PLACE60021510 
PLACE60024190 
PLACE60026680 
PLACE60026920 
PLACE60026990 
PLACE60029490 
PLACE60030380 
PLACE60030940 
PLACE60031090 
PLACE60032040 
PLACE60033720 
PLACE60033990 
PLACE60037050 
PLACE60037400 
PLACE60037450 
PLACE60038500 
PLACE60040050 
PLACE60043120 
PLACE60043360 
PLACE60043960 
PLACE60043970 
PLACE60044540 
PLACE60044640 
PLACE60044910 
PLACE60046630 
PLACE60046870 
PLACE60047380 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



250". 684" 
1570. .1995 
1126. .1521 
243.. 11 75 
1574.2191 
1170.. 1517 
908. 1564 
243..845 
1792. .21 54 
449..994 
447..791 
140.. 631 
8..337 
76 .2601 
159.587 
215..1816 
94..1614 
174..524 
1562..>2068 
41. .412 
654.. 1382 
106. .600 
261. .1871 
15.. 1145 
332.. 1924 



130.477 
28..1149 
61. ,720 
650.. 1270 
110.. 448 
595.. 1053 
403.. 11 73 
485.. 11 59 
597..923 
11 2.. 558 
734.. 1264 



SEQ ID NO. of amino acid 

sequence 



"273"7"" 
2738 
2739 
2740 
2741 
2742 
2743 
2744 
2745 
2746 
2747 
2748 
2749 
2750 
2751 
2752 
2753 
2754 
2755 
2756 
2757 
2758 
2759 
2760 
2761 
2762 
2763 
2764 
2765 
2766 
2767 
2768 
2769 
2770 
2771 
2772 
2773 
2774 
2775 
2776 
2777 
2778 
2779 
2780 
2781 
2782 
2783 
2784 
2785 



1101 
1102 
1103 
1104 



1113 
1114 



1122 
1123 
1124 



1130 
1131 
1132 

1133 



1142 
1143 
1144 
1145 
1146 



15..E 



129.. 1754 
208.. 1842 
22..>2582 
319..2100 
1249.. 1572 
1108..1620 
37.. 471 
384.. 1250 
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Table 1 (continued) 



"PLACE600493f0"" 
PLACE60049930 
PLACE60050290 
PR0ST1 0001 520 
PR0ST1 0001 670 
PR0ST1 0002200 
PR0ST1 0002460 
PR0ST1 0002720 
PR0ST1 0003430 
PR0ST1 0005260 
PR0ST1 0005360 
PR0ST1 0005640 
PROST20000360 
PROST20000530 
PROST20001760 
PROST20002060 
PROST20002670 
PROST20002730 
PROST20002740 
PROST20003250 
PROST20004630 
PROST200 17390 
PROST200 17960 
PROST200 18230 
PROST20018990 
PROST20019980 
PROST20021620 
PROST20023380 
PROST20025910 
PROST20026820 
PROST20028420 
PROST20029600 
PROST20031020 
PROST20031170 
PROST20032100 
PROST20032320 
PROST20033020 
PROST20033030 
PROST20033380 
PROST20033400 
PROST20034720 
PROST20037320 
PROST20039220 
PROST20043320 
PROST20044160 
PROST20044810 
PROST20051210 
PROST20051430 
PROST20054260 



SEQ ID NO, of nucleotide 

sequence 

11"4T 
1148 



1152 
1153 
1154 
1155 
1156 
1157 
1158 
1159 
1160 
1161 
1162 
1163 
1164 
1165 



Position of CDS 



'18.320 
1324.. 1641 



1847. 2 149 
608.. 1660 



114.689 
259.. 2445 
1214. 1591 
240..2477 
139..525 
1580.. 1969 
1439. 1825 
44.1276 
1072.. 1506 
547..897 
1070.. 1477 
1557.. 1910 
96..965 
495..815 
122..505 
1401. .1829 
89.. 1447 
2024.. 2953 
1544.. 1852 
1439 .1918 
11. .559 
94., 408 
411. ,1511 
1173, ,1607 
245„856 
20, .1456 
163,1578 
82,. 1671 
2537, 3004 
1550, ,1906 
247..567 
19. .1524 
294..647 
280.. 1764 
1001. .1414 
1790. .241 6 
278.. 1843 
435.. 866 
1442.. 1759 
405..1217 
76. 540 
56..952 



SEQ ID NO. of amino acid 

sequence 



1177 
1178 
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Table 1 (continued) 



"PR6ST2d056040 " 
PROST20058800 
PROST20059190 
PROST20059430 
PROST20061960 
PROST20062600 
PROST20064500 
PROST20067370 
PROST20069880 
PROST20072370 
PROST20072890 
PROST20073170 
PROST20073890 
PROST20079740 
PROST20085160 
PROST20094830 
PUAEN1 0000570 
PUAEN10000810 
PUAEN10001610 
PUAEN1 0003220 
SALGL1 0000050 
SALGL1 0000470 
SALGL1 0000650 
SALGL1 0001 570 
SKMUS1 0000140 
SKMUS1 0000220 
SKMUS1 0000640 
SKMUS1 0001 040 
SKMUS10001180 
SKMUS10001240 
SKMUS10001290 
SKMUS1 0001 770 
SKMUS20000740 
SKMUS20001170 
SKMUS20002710 
SKMUS20003430 
SKMUS20003650 
SKMUS20003900 
SKMUS20004580 
SKMUS20004670 
SKMUS20004680 
SKMUS20007240 
SKMUS20007740 
SKMUS20008470 
SKMUS20008630 
SKMUS20009020 
SKMUS20009330 
SKMUS20009450 
SKMUS20009540 



SEQ ID NO, of nucleotide 

sequence 



1196 
1197 



1200 
1201 
1202 
1203 
1204 
1205 
1206 
1207 
1208 
1209 
1210 
1211 
1212 
1213 
1214 
1215 
1216 



Position of CDS 



T235:i"552"" 
179..493 
94.. 483 
1475.. 1792 
7..393 
145.. 2367 
146.466 
1219. .1944 
45. 2252 
139.2256 
1664. 2632 
166.. 1905 
1022. 1324 
147.527 
63.734 
58.2328 
127..2220 
318..2234 
582.. 3794 
173..946 
298..933 
11. .619 
193..543 
269.. 1282 
68.. 1234 
102.. 1367 
137..1198 
240.. 11 00 
300.. 1076 
35..>1069 
66. .749 
203. .1276 
26. .1135 
105.. 1019 
17. 1174 
163. .861 
32, .388 
135.. 1112 
166„>1905 
141 ,,443 
4..453 
107..1120 
66.. 1061 
183..524 
294.. 1727 
396.. 1631 
96.752 
796.. 11 49 
193.. 1260 



SEQ ID NO. of amino acid 

sequence 



1226 
1227 



1240 
1241 
1242 
1243 
1244 
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Table 1 (continued) 



"SKM0S200T0080" 
SKMUS20011290 
SKMUS20011470 
SKMUS20013640 
SKMUS20014920 
SKMUS20015010 
SKMUS20015430 
SKMUS20016080 
SKMUS20016310 
SKMUS20016340 
SKMUS20016620 
SKMUS20016680 
SKMUS20016710 
SKNMC1 0000070 
SKNMC10000100 
SKNMC1 00001 90 
SKNMC1 0000290 
SKNMC10001100 
SKNMC10001590 
SKNMC10001680 
SKNMC1 0002290 
SKNMC10002510 
SKNMC1 0002640 
SKNMC20000650 
SKNMC20000970 
SKNMC20002240 
SKNMC20003050 
SKNMC20003220 
SKNMC20003560 
SKNMC20005930 
SKNMC20006120 
SKNMC20010570 
SKNMC20011130 
SKNMC20015030 
SKNMC20015550 
SKNMC20015960 
SKNSH1 0000860 
SKNSH1 0001 740 
SKNSH1 0003010 
SKNSH1 0003080 
SKNSH20001510 
SKNSH20001630 
SKNSH20003470 
SMINT10000160 
SMINT1 0000390 
SMI NT1 0000420 
SMINT1 0000540 
SMI NT1 0000570 
SMINT1 0000710 



SEQ ID NO. of nucleotide 

sequence 



" 1245"" 
1246 
1247 
1248 
1249 
1250 
1251 
1252 
1253 
1254 
1255 
1256 
1257 
1258 
1259 
1260 
1261 
1262 
1263 
1264 
1265 



1272 
1273 
1274 
1275 
1276 



1284 
1285 



1289 
1290 
1291 
1292 
1293 



Position of CDS 



■2"1"0".839"""" 
116.. 1375 
255. .968 
11 7. .476 
199.. 942 
178.. 921 
53 .847 
145.. 1026 
385.. 876 
274.. >1 622 
30. .806 
196.. 1290 
30. 722 
1265. .1594 
296.. 694 



405.. 920 
720.. 1991 
1187.. 1927 
1798 .2472 
265.2565 
68.71 8 
55, .1758 
340., 2028 
797,, 1939 
154..>1168 
351. .1091 
69, .650 
363.. 1262 
1176,, 1589 
79., 2085 
98 .955 
1129.1689 
463.. 1035 
74,.>3352 
786.. 1250 
361 ..1458 
464. 1669 
389. 871 
462.776 
1087.. 1596 
390..863 
158.. 1729 
740.. 1045 
76..>2645 
97.636 
70.. 1545 
1763..>2065 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"SMrNTIOOOfOOO"" 
SMINT1 0001 030 
SMINT1 00011 80 
SMINT20000180 
SMINT20000400 
SMINT20001450 
SMINT20002270 
SMINT20002390 
SMINT20002770 
SMINT20003960 
SMINT20004000 
SMINT20005450 
SMINT20005580 
SPLEN1 0000490 
SPLEN10000910 
SPLEN 1000 1430 
SPLEN20000200 
SPLEN20000470 
SPLEN20000720 
SPLEN20001340 
SPLEN20001970 
SPLEN20002420 
SPLEN20002430 
SPLEN20002670 
SPLEN20002700 
SPLEN20003100 
SPLEN20003570 
SPLEN20004430 
SPLEN20004960 
SPLEN20005410 
ST0MA1 0000470 
ST0MA1 0000520 
ST0MA1 00011 70 
ST0MA1 0001 330 
ST0MA1 0001 860 
STOI\/IA20000320 
STOMA20000880 
STOMA20001210 
STOMA20001880 
STOMA20002570 
STOMA20002890 
STOMA20003960 
STOMA20004780 
STOMA20004820 
SYN0V1 0001 280 
SYN0V1 0001 640 
SYNOV20001770 
SYNOV20002910 
SYNOV20008200 



SEQ ID NO. of nucleotide 

sequence 



1294 
1295 



1300 
1301 



1307 
1308 
1309 
1310 
1311 
1312 
1313 
1314 
1315 
1316 
1317 
1318 
1319 
1320 
1321 
1322 
1323 
1324 
1325 
1326 
1327 
1328 
1329 



1333 
1334 
1335 
1336 
1337 
1338 
1339 
1340 
1341 
1342 



Position of CDS 



27'.470 
415..2226 
1176. .1487 
1466.. 1879 
243..548 
11. .718 
1427. 1819 
517..885 
220.. 11 67 
5.. 2851 



1001. .1387 
1700. .2029 
27.1364 
742.. 1347 
165..695 
1974.. 2282 
1030. 1560 
808.. 1842 
61. .1146 
414.. 1310 
661. .1191 
344.. 763 
1638,.>3316 
69,. 440 
1001, .1321 
928..2322 
509..817 
1792..2175 
12. 824 
135. .533 
1790, .2170 
196..1818 
410.. 1858 
364.. 1476 
943.. 1278 
1825, .2286 
196.. 1458 
1068, .1757 
429.. 746 
187.801 
591 ..1748 
129..554 
206.. 1228 
111.. 2084 
206.. 1033 
359..673 
1085. 1750 
461 ..>2203 



SEQ ID NO. of amino acid 

sequence 
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Table 1 (continued) 



"SYN6V200T0l46" 
SYNOV20011440 
SYNOV20013740 
SYNOV20014510 
SYNOV20014570 
SYNOV20016480 
TEST1 10000230 
TEST1 10000250 
TEST1 10000420 
TEST1 10000510 
TEST1 10000550 
TEST1 10000640 
TEST1 10000700 
TEST1 10000960 
TEST1 10001 250 
TEST1 10001 270 
TESTI10001310 
TEST1 10001 380 
TEST1 10001 630 
TEST1 10001 680 
TEST1 10001 790 
TESTI10001910 
TESTI20000180 
TESTI20000440 
TESTI20001200 
TESTI20001540 
TESTI20001770 
TESTI20001790 
TESTI20001840 
TESTI20002070 
TESTI20002080 
TESTI20002380 
TESTI20002530 
TESTI20003560 
TESTI20003720 
TESTI20004350 
TESTI20004620 
TESTI20005200 
TESTI20005910 
TESTI20006000 
TESTI20006270 
TESTI20006710 
TESTI20006950 
TESTI20006990 
TESTI20007070 
TESTI20007620 
TESTI20007840 
TESTI20008190 
TESTI20008300 



SEQ ID NO. of nucleotide 

sequence 



" 1343"" 
1344 
1345 
1346 
1347 
1348 
1349 
1350 
1351 
1352 
1353 
1354 
1355 
1356 
1357 
1358 



Position of CDS 



■7"2".458""" 
275.. 1468 
147.. 1472 



132.. 656 
1101. .1415 
167.. 2032 
175.. 1863 
75. 2297 
46. .1185 
106.. 1950 
225.. 2042 
274. 978 
1027. 1641 
208..1581 
38.. 1549 
732.. 1868 
249.. 1298 
134..1390 
1577.1936 
935.. 1786 
234..563 
166..2238 
664.. 1059 
230.. 1684 
1209.2447 
154..747 
16.. 1962 
1161. .1631 
1784. 2425 
528.. 1859 
289.. 1071 
654. 1013 
578.. 11 44 
97. 726 
91 6.. 1833 
192.. 1379 
30, .1469 
645., 1682 
72..>2174 
334..645 
68..>2012 
134..2155 
56.. 1084 
125..961 
36. 3281 
820.. 1236 
323.. 11 86 



SEQ ID NO. of amino acid 

sequence 



"298"2"" 
2983 
2984 



2991 
2992 
2993 
2994 
2995 
2996 
2997 



3000 
3001 
3002 
3003 
3004 
3005 
3006 
3007 
300B 
3009 
3010 
3011 
3012 
3013 
3014 
3015 
3016 
3017 
3018 
3019 
3020 
3021 
3022 
3023 
3024 
3025 
3026 
3027 
3028 



1390 
1391 
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Table 1 (continued) 



"TESTI200"08490 " 

TESTI20008830 

TESTI20009090 

TESTI20009510 

TESTI20009700 

TESTI20010080 

TESTI20010490 

TESTI20010820 

TESTI20011340 

TESTI20011410 

TESTI20011800 

TESTI20012370 

TESTI20012690 

TESTI20013060 

TESTI20013300 

TESTI20013450 

TESTI20013520 

TESTI20014120 

TESTI20014200 

TESTI20015110 

TESTI20015120 

TESTI20015560 

TESTI20015930 

TESTI20016210 

TESTI20016610 

TESTI20016650 

TESTI20016710 

TESTI20017580 

TESTI20017660 

TESTI20017920 

TESTI20018150 

TESTI20018260 

TESTI20018270 

TESTI20018290 

TESTI20018520 

TESTI20018620 

TESTI20018690 

TESTI20018790 

TESTI20018980 

TESTI20019500 

TESTI20019680 

TESTI20019910 

TESTI20020020 

TESTI20020480 

TESTI20020570 

TESTI20020810 

TESTI20020900 

TESTI20021050 

TESTI20021490 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



59" .1333" 
1131. .1718 
562. .894 
225..1115 
84. 1628 
1565..>2060 
730.2349 
1609. .1926 
243.. 1346 
425..2725 
228.. 11 78 
257..2071 
660..2603 
125.427 
22..>2563 
187..1917 
334..759 
185.. 1384 
214.. 1299 
61. .2058 
565. 1287 
400. .1326 
1157.. 1468 
1121. .1474 
280.. 3345 
21 90.. 2549 
1071 .2003 
143.676 
317..652 
84. 890 
815. .1993 
310,. 1113 
19. .1899 
337..>2156 
46..2211 
1355. .1729 
11 78.. 2425 
481. .1821 
146. .559 
126.. 1403 
1168. .1509 
345., 1601 
1547..>1884 
255.569 
217.. 1317 
194.. 1498 
81. .1547 
68.2689 
45.. 1814 



SEQ ID NO. of amino acid 

sequence 



1395 
1396 
1397 



1403 
1404 
1405 
1406 
1407 



1418 
1419 
1420 
1421 

1422 
1423 



1438 
1439 
1440 
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Table 1 (continued) 



"TESTI20022230 " 

TESTI20022450 

TESTI20022510 

TESTI20022560 

TESTI20022640 

TESTI20022940 

TESTI20023610 

TESTI20023690 

TESTI20024150 

TESTI20024230 

TESTI20024610 

TESTI20024650 

TESTI20024670 

TESTI20024980 

TESTI20025160 

TESTI20025440 

TESTI20025800 

TESTI20026320 

TESTI20026760 

TESTI20026980 

TESTI20027000 

TESTI20027070 

TESTI20027290 

TESTI20027890 

TESTI20028060 

TESTI20028400 

TESTI20028660 

TESTI20029120 

TESTI20029650 

TESTI20030050 

TESTI20030370 

TESTI20030590 

TESTI20030710 

TESTI20030740 

TESTI20031090 

TESTI20031170 

TESTI20031300 

TESTI20031520 

TESTI20031930 

TESTI20031960 

TESTI20032280 

TEST1 20032550 

TESTI20032800 

TESTI20032990 

TESTI20033250 

TESTI20033270 

TESTI20033540 

TESTI20033560 

TESTI20033760 



SEQ ID NO. of nucleotide 

sequence 



1441 
1442 



1445 
1446 
1447 



Position of CDS 



205".783"""" 
59.. 1333 
97..2217 
17.. 2494 
260..913 
41.. 469 
365.. 949 
336.. 11 60 
400.. 1080 
125.. 1339 
181. .1605 
45.. 1829 
1375. .1845 
261 ..1889 
133.. 11 64 
293..1819 
60.. 1022 
31. .414 
830.. 1663 
21 2.. 1354 
22 .573 
88, .1410 
158.. 1333 
104..>1506 
1596, .1970 
740.. 1066 
1111. .1995 
295.. 1920 
391. .1446 
21, .800 
271. .1368 
1028, .1474 
88 .666 
85. 2055 
28..>2034 
188.. 1660 
157.. 2004 
150.. 1862 
474.. 1010 



242.. 670 
1445,. 1873 
49..411 
98. .2041 
48..722 
311. .1930 
193.. 1296 
294..659 



SEQ ID NO. of amino acid 

sequence 



1462 
1463 
1464 
1465 



1470 
1471 
1472 



1485 
1486 
1487 
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Table 1 (continued) 



"TESTI200"34130" 

TESTI20034180 

TESTI20034190 

TESTI20034980 

TESTI20035120 

TESTI20035410 

TESTI20035510 

TESTI20035740 

TESTI20035800 

TESTI20035890 

TESTI20036250 

TESTI20036490 

TESTI20037270 

TESTI20037810 

TESTI20038940 

TESTI20039140 

TESTI20039980 

TESTI20040000 

TESTI20040310 

TESTI20041110 

TESTI20041220 

TESTI20042070 

TESTI20042290 

TESTI20042430 

TESTI20042870 

TESTI20042950 

TESTI20047120 

TESTI20049290 

TESTI20049820 

TESTI20049940 

TESTI20051550 

TESTI20052680 

TESTI20053960 

TESTI20054080 

TESTI20054920 

TESTI20055840 

TESTI20056900 

TESTI20057310 

TESTI20057420 

TESTI20058600 

TESTI20062380 

TESTI20062550 

TESTI20064250 

TESTI20064830 

TESTI20065720 

TESTI20067740 

TESTI20068660 

TESTI20068720 

TESTI20069780 



SEQ ID NO, of nucleotide 

sequence 



" 1490"" 
1491 
1492 
1493 
1494 
1495 
1496 



1501 
1502 
1503 
1504 
1505 
1506 
1507 
1508 
1509 
1510 
1511 
1512 
1513 
1514 
1515 
1516 
1517 
1518 
1519 
1520 
1521 



Position of CDS 



■l"0"9".;244"4"" 
1113. .1484 
120.. 1520 
265.. 11 85 
321. .2585 
735.. 1208 
734.. 1741 
41. .1651 
1263. .1583 
206.. 1507 
45. .221 6 
88. 459 
33. .1391 
128.520 



SEQ ID NO. of amino acid 

sequence 



116( 



1536 
1537 
1538 



73.. 1530 
2..514 
291 ..881 
126.. 1463 
143..502 
33.. 1703 
95.. 1903 
647.. 1711 
1227.. 1535 
1697. 2263 
187. ,1887 
112. 1002 
402.. 1226 
157..1875 
50.. 550 
79..501 
60.. 1247 
402.2522 
108.929 
137..568 
210..1214 
124.. 1788 
287.. >1 853 
164.. 1636 
430.. 11 67 
719.. 1567 
511. .837 
537.. 1436 
373.. 1614 
222.. 644 
382.. 1239 
87.1337 
378.. 1331 
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Table 1 (continued) 



"TESTI20069790 

TESTI20071830 

TESTI20073580 

TESTI20074020 

TESTI20074640 

TESTI20074660 

TESTI20074800 

TESTI20076130 

TESTI20077490 

TESTI20077500 

TESTI20078140 

TESTI20078640 

TESTI20078670 

TESTI20078720 

TESTI20079510 

TESTI20080200 

TESTI20080330 

TESTI20081390 

TESTI20081440 

TESTI20082340 

TESTI20082400 

TESTI20083430 

TESTI20083870 

TESTI20084400 

TESTI20086570 

TESTI20087740 

TESTI20088470 

TESTI201 36910 

TESTI201 38320 

TESTI201 40360 

TESTI201 77400 

TESTI30000020 

THYMU 10000020 

THYMU 10000320 

THYMU1 0000830 

THYMU1 0001 050 

THYMU1 0001 760 

THYMU1 0002910 

THYMU 10003290 

THYMU 10003590 

THYMU1 0003660 

THYMU 10003820 

THYMU 10004590 

THYMU 10004730 

THYMU1 0004910 

THYMU 10005270 

THYMU 10005580 

THYMU20001400 

THYMU20002360 



SEQ ID NO. of nucleotide 

sequence 



Position of CDS 



SEQ ID NO. of amino acid 

sequence 



1542 
1543 
1544 
1545 



1553 
1554 
1555 



1559 
1560 
1561 
1562 
1563 
1564 
1565 
1566 
1567 
1568 
1569 
1570 



1583 
1584 
1585 
1586 
1587 



3.. 1394 
1097.. 1426 
204..>1824 
124.. 1683 
81. .2000 
122..>2107 
852.. 1598 
391. .1722 
897.. 1400 
287.. 802 
31 8.. 1952 
92. .1309 
824.3025 
153.. 2429 
72.. 497 
118.. 1839 
147..545 
931. .1419 
409.834 
157..2979 
31.. 652 
1749. 2078 
246.. 1289 
14, .1900 
362. .757 
1667 .2026 
98.. 1060 
831. .1349 
70, .1584 
97, .1914 
286.. 1131 
1299, .1691 



116S 



632.. 934 
1..492 
1598, .2026 
22, 534 
743..2005 
1749, .2084 
1242.. 1601 
594.. 11 42 
421 ..735 
301 ..1119 
1706..>2057 
896..1819 
42..410 
2..385 
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Table 1 (continued) 



"THYM020d03"l70" 

THYI\/IU20003690 

TRACH1 00001 80 

TRACH1 0000300 

TRACH1 0000570 

TRACH1 0000630 

TRACH 10000740 

TRACH1 0001 000 

TRACH1 0001 060 

TRACH1 0001250 

TRACH1 0001400 

TRACH20000150 

TRACH20000790 

TRACH20001850 

TRACH20001960 

TRACH20002350 

TRACH20002370 

TRACH20002500 

TRACH20002890 

TRACH20003930 

TRACH20004110 

TRACH20004200 

TRACH20004610 

TRACH20004720 

TRACH20004960 

TRACH20004970 

TRACH20006650 

TRACH20006750 

TRACH20007670 

TRACH20007800 

TRACH20008940 

TRACH20008980 

TRACH20009260 

TRACH20009440 

TRACH20011920 

TRACH20012890 

TRACH20013950 

TRACH20014000 

TRACH20015920 

TRACH20016070 

UMVEN10001220 

UMVEN20001330 

UTERU1 0000770 

UTERU1 0000960 

UTERU1 0001 600 

UTERU1 0001 920 

UTERU20000470 

UTERU20003380 

UTERU20003930 



SEQ ID NO, of nucleotide 

sequence 



Position of CDS 



■703:.T04"1 
328.. 1782 
63. .1205 
384..902 
722.. 1039 
431 ..1372 
80. 1672 
181 ..660 
744.. 1121 
21. .1610 
266..622 
1303.. 1755 
47.. 385 
19..750 
144..974 
1123..1437 
197..1471 
120.. 1682 
895..2022 
1403..>2562 
150.. 1844 
80..>2895 
498..2084 
435.. 1940 
115.. 1842 
1083.. 1517 
208.. 1866 
232. 1200 
11 47.. 1452 
492.. 1937 
701. .1957 
350.. 661 
75 .770 
879.. 1235 
530..>2034 
143..715 
1697, .2602 
1626, .1949 
569. .877 
484.. 11 76 
698.. 1339 
164..2242 
2074.. 241 8 
952.. 1593 
527.. 1747 
11 2.. 474 
1691..>2197 
130..816. 
514.. 1101 



SEQ ID NO. of amino acid 

sequence 



"322"7"" 
3228 
3229 
3230 
3231 
3232 
3233 
3234 
3235 
3236 
3237 
3238 
3239 
3240 
3241 
3242 
3243 
3244 
3245 
3246 
3247 
3248 
3249 
3250 
3251 
3252 
3253 
3254 
3255 
3256 
3257 
3258 
3259 
3260 
3261 
3262 
3263 
3264 
3265 
3266 
3267 



3270 
3271 
3272 
3273 
3274 
3275 



1591 
1592 



1606 
1607 



1615 
1616 
1617 
1618 
1619 
1620 
1621 
1622 



1627 
1628 
1629 



1634 
1635 
1636 
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Table 1 (continued) 



SEQ ID NO. of nucleotide 
sequence 



"UTERG2d004850 " 
UTERU20005410 
UTERU20005690 



Position of CDS 

"3857. 732" 
432..800 
217..1899 



SEQ ID NO. of amino acid 

sequence 

327"6" 
3277 
3278 



[0028] Namely, primers used to synthiesize polynucleotides can be designed based on the nucleotide sequences of 
polynucleotides of the present Invention shown In SEQ ID NOs In the above Table 1 . When one intends to synthesize 
full-length cDNAs, an oiigo dT primer can be used as the 3'-end primer. The length of the primers Is usually 15-100 
bp, and favorably between 15-35 bp. In case of LA PGR, which Is described below, the primer length of 25-35 bp may 
provide a good result. 

[0029] A method to design a primer that enables a specific amplification based on the aimed nucleotide sequence 
Is known to those skilled In the art (Current Protocols in iVIolecular Biology, Ausubel et ai. edit, (1987) John Wiley & 
Sons, Section 6.1-6.4). In designing a primer based on the 5'-end sequence, the primer Is designed so as that, in 
principle, the amplification products will include the translation start site. Accordingly, for example, when the 5'-end 
primer is designed based on the nucleotide sequence of 5' untranslated region (5'UTR), any part of the 5'-end, which 
ensures the specificity to the cDNA of interest, can be selected as the primer 

[0030] When synthesizing a full-length cDNA, the target nucleotide sequence to be amplified can extend to several 
thousand bp in some cDNA. However, it is possible to amplify such a long nucleotides by using such as LA PGR (Long 
and Accurate PGR), it is advantageous to use LA PGR when synthesizing long DNA. In LA PGR, in which a special 
DNA polymerase having 3' -> 5' exonuclease activity Is used, mislncorporated nucleotides can be removed . Accordingly 
accurate synthesis of the complementary strand can be achieved even with a long nucleotide sequence. By using LA 
PGR, it Is reported that amplification of a nucleotide with 20 kb longer can be achieved under desirable conditions 
(Takeshi Hayashi (1 996) Jikken-lgaku Bessatsu, "Advanced Technologies In PGR" Youdo-sha). 
[0031] A template DNA for synthesizing the full-length cDNAof the present Invention can be obtained by using cDNA 
libraries that are prepared by various methods. The full-length cDNA clones of the present invention are clones with 
high probability of completeness In length, which were obtained by the method comprising the steps of [1] preparing 
libraries containing cDNAs with the very high fullness ratio by ollgo-capping, and [2] assembling the 5'-end sequences 
and selecting one with the highest probability of completeness in length In the cluster formed (there are many clones 
longer in the 5'-end direction). 

[0032] However, the uses of primers designed based onthefull-iength nucleotide sequences provided by the present 
invention enable easily obtaining full-length cDNAs without such a special technique. 

[0033] The problem with the cDNA libraries prepared by the known methods or commercially available Is that mRNA 
contained in the libraries has very low fullness ratio. Thus, It Is difficult to screen full-length cDNA clone directly from 
the library using ordinary cloning methods. The present Invention has revealed a nucleotide sequence of novel full- 
length cDNA. if a full-length nucleotide sequence Is provided, It Is possible to synthesize a target full-length cDNA by 
using enzymatic reactions such as PGR. In particular, a full-length-enrlched cDNA library, synthesized by methods 
such as oligo-capping, is desirable to synthesize a full-length cDNA with more reliability. 

[0034] The 5'-end sequence of the full-length cDNA clones of the invention can be used to isolate the regulatory 
element of transcription including the promoter on the genome, A rough draft of the human genome (analysis of human 
genomic sequence with lower accuracy), which covers 90% of the genome, has been reported (Nature, Vol.409, 
81 4-823, 2001 ), and by the year 2003, analysis of the entire human genomic sequence Is going to be finished. However, 
it is hard to analyze with software the transcription start sites on the human genome, in which long introns exist. By 
contrast, it is easy to specify the transcription start site on the genomic sequence using the nucleotide sequence which 
Includes the 5'-end of the full-length cDNA clone of the present Invention, and thus It Is easy to obtain the genomic 
region Involved In transcription regulation, which Includes the promoter that Is contained in the upstream of the tran- 
scription start site. 

[0035] The polypeptide encoded by the full-length cDN A of the Invention can be prepared as a recombinant polypep- 
tide or as a natural polypeptide. For example, the recombinant polypeptide can be prepared by Inserting the polynu- 
cleotide encoding the polypeptide of the Invention Into a vector. Introducing the vector into an appropriate host cell and 
purifying the polypeptide expressed within the transformed host cell, as described below. In contrast, the natural 
polypeptide can be prepared, for example, by utilizing an affinity column to which an antibody against the polypeptide 
of the invention (Current Protocols In Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 
16.1-1 6.19) is attached. The antibody used for affinity purification may be either a polyclonal antibody, or a monoclonal 
antibody. Alternatively, in v/fratranslatlon (See, for example, "On thefldellty of mRNA translation In the nuclease-treated 
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rabbit reticulocyte lysate system." Dasso IVI.C., and Jaclcson R.J. (1989) Nucleic Acids Res. 17: 3129-3144) may be 
used for preparing the polypeptide of the Invention. 

[0036] Polypeptides functionally equivalent to the polypeptides of the present Invention can be prepared based on 
the activities, which were clarified In the above-mentioned manner, of the polypeptides of the present Invention. Using 
5 the biological activity possessed by the polypeptide of the Invention as an index, it is possible to verify whether or not 
a particular polypeptide is functionally equivalent to the polypeptide of the Invention by examining whether or not the 

polypeptide has said activity. 

[0037] Polypeptides functionally equivalent to the polypeptides of the present invention can be prepared by those 
skilled in the art, for example, by using a method for introducing mutations into an amino acid sequence of a polypeptide 

10 (for example, site-directed mutagenesis (Current Protocols In Molecular Biology, edit, Ausubel et ai., (1987) John Wiley 
& Sons, Section 8.1-8.5). Besides, such polypeptides can be generated by spontaneous mutations. The present in- 
vention also Includes a polypeptide comprising the amino acid sequence shown In Table 1 In which one or more amino 
acids are substituted, deleted. Inserted, and/or added, as long as the polypeptides have the equivalent functions to 
those of the polypeptides identified in the present Examples described later 

'5 [0038] There are no limitations on the number and sites of amino acid mutations, as long as the polypeptides maintain 
the functions thereof. The number of mutations typically corresponds to 30% or less, or 20% or less, or 10% or iess, 
preferably 5% or less, or 3% or less of the total amino acids, more preferably 2% or less or 1% or less of the total 
amino acids. Alternatively, herein, substitution of one or more amino acids includes substitution of several amino acids. 
As used herein, the term "several amino acids" means, for example, 5 amino acids, preferably 4 or 3 amino acids, 

20 more preferably 2 amino acids, and further preferably 1 amino acid. 

[0039] From the viewpoint of maintaining the polypeptide function. It Is preferable that a substituted amino acid has 
a similar property to that of the original amino acid. For example, Ala, Val, Leu, lie. Pro, Met, Phe and Trp are assumed 
to have similar properties to one another because they are all classified Into a group of non-polar amino acids. Similarly, 
substitution can be performed among non-charged amino acid such as Gly, Ser, Thr, Cys, Tyr, Asn, and Gin, acidic 

25 amino adds such as Asp and Glu, and basic amino acids such as Lys, Arg, and His. 

[0040] In addition, polypeptides functionally equivalent to the polypeptides of the present invention can be isolated 
by using techniques of hybridization or gene amplification known to those skilled in the art. Specifically, using the 
hybridization technique (Current Protocols in Molecular Biology, edit, Ausubel et al., (1987) John Wiley & Sons, Section 
6.3-6.4)), those skilled in the art can usually isolate a polynucleotide highly homologous to the polynucleotide encoding 

30 the polypeptide Identified in the present Example based on the identified nucleotide sequence (Table 1) or a portion 
thereof and obtain the functionally equivalent polypeptide from the isolated polynucleotide. The present invention in- 
clude polypeptides encoded by the polynucleotides hybridizing with the polynucleotides encoding the polypeptides 
identified in the present Example, as long as the polypeptides are functionally equivalent to the polypeptides identified 
In the present Example. Organisms from which the functionally equivalent polypeptides are Isolated are Illustrated by 

35 vertebrates such as human, mouse, rat, rabbit, pig and bovine, but are not limited to these animals. 

[0041] Washing conditions of hybridization for the Isolation of polynucleotides encoding the functionally equivalent 
polypeptides are usually "1x SSC, 0.1% SDS, 37°C"; more stringent conditions are"0.5x BSC, 0.1% SDS, 42°C"; and 
still more stringent conditions are "O.lx SSC, 0.1% SDS, 65°C". Alternatively, the following conditions can be given as 
hybridization conditions of the present invention. Namely, conditions in which the hybridization is done at "6x SSC, 

40 40% Formamide, 25°C", and the washing at "1x SSC, 55°C" can be given. More preferable conditions are those in 
which the hybridization is done at "6x SSC, 40% Formamide, 37°C", and the washing at "0.2x SSC, 55°C". Even more 
preferable are those in which the hybridization is done at "6x SSC, 50% Formamide, 37''C", and the washing at "O.lx 
SSC, 62°C". The more stringent the conditions of hybridization are, the more frequently the polynucleotides highly 
homologous to the probe sequence are Isolated. Therefore, it Is preferable to conduct hybridization under stringent 

45 conditions. Examples of stringent conditions In the present Invention are, washing conditions of "0.5x SSC, 0.1% SDS, 
42°C", or alternatively, hybridization conditions of "6x SSC, 40% Formamide, 37°C, and the washing at "0.2x SSC, 
55°C". 

[0042] One skilled In the art can suitably select various conditions, such as dilution ratios of SSC, fomiamide con- 
centrations, and temperatures to accomplish a similar stringency. 
50 [0043] However, the above-mentioned combinations of SSC, SDS and temperature conditions are Indicated just as 
examples. Those skilled in the art can select the hybridization conditions with similar stringency to those mentioned 
above by properly combining the above-mentioned or other factors (for example, probe concentration, probe length 
and duration of hybridization reaction) that determines the stringency of hybridization. 

[0044] The amino acid sequences of polypeptides Isolated by using the hybridization techniques usually have high 
55 identity to those of the polypeptides of the present Invention, which are shown in Table 1 . The present invention en- 
compasses a polynucleotide comprising a nucleotide sequence that has a high Identity to the nucleotide sequence of 
claim 1 (a). Furthermore, the present Invention encompasses a peptide, or polypeptide comprising an amino acid 
sequence that has a high identity to the amino acid sequence encoded by the polynucleotide of claim 1 (b). The term 
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"high identity" indicates sequence identity of at least 40% or more; preferably 60% or more; and more preferably 70% 
or more. Alternatively, more preferable is identity of 90% or more, or 93% or more, or 95% or more, furthermore, 97% 
or more, or 99% or more. The identity can be determined by using the BLAST search algorithm. 
[0045] As used herein, "percent identity" of amino acid sequences or nucleic acids Is determined using the algorithm 

5 BLAST of Karlin and Altschul (Proc. Natl. Acad. Sci. USA 90:5873-5877, 1993). Such an algorithm is incorporated into 
the BLASTN and BLASTX programs of Altschul et al. (J. Mol. Biol.21 5:403-410, 1990). BLAST nucleotide searches 
are performed with the BLASTN program, for example, score = 100, wordlength = 12. BLAST protein searches are 
performed with the BLASTX program, for example, score = 50, wordlength = 3. When utilizing BLAST and Gapped 
BLAST programs, the default parameters of the respective programs are used. See http://www.ncbi.nlm.nih.gov. 

10 [0046] With the gene amplification technique (PGR) (Current Protocols in Molecular Biology, edit, Ausubel et al., 
(1987) John Wiley & Sons, Section 6.1-6.4)) using primers designed based on the nucleotide sequence (Table 1) or a 
portion thereof identified in the present Example, it is possible to isolate a polynucleotide fragment highly homologous 
to the polynucleotide sequence or a portion thereof and to obtain functionally equivalent polypeptide to a particular 
polypeptide identified in the present Example based on the isolated polynucleotide fragment. 

15 [0047] The present invention also provides a polynucleotide containing at least 15 nucleotides complementary to a 
polynucleotide comprising a nucleotide sequence of SEQ ID NOs shown in Tablet orthe complementary strand thereof. 
Herein, the term "complementary strand" is defined as one strand of a double strand DNA composed of A:T and G:C 
base pair to the other strand. Also, "complementary" is defined as not only those completely matching within a contin- 
uous region of at least 15 nucleotides, but also having a identity of at least 70%, favorably 80% or higher, more favorably 

20 90% or higher, and most favorably 95% or higher within that region. The Identity may be determined using the algorithm 
described herein. 

[0048] Such a polynucleotide includes probes and primers used forthe detection and amplification of a polynucleotide 
encoding the inventive polypeptide. When used as a primer, the polynucleotide usually comprises 15 to 100 bp, and 
preferably of 15 to 35 bp. When used as a probe, the polynucleotide comprises the whole or a part of the sequence 

25 of a polynucleotide of the invention, and comprises at least 15 bp. When used as primers, such polynucleotides are 
complementary at the 3'-end, and restriction enzyme recognition sequences or tags can be added to the 5'-end. 
[0049] Furthermore, polynucleotides of the present invention include an antisense polynucleotide for suppressing 
the expression of a polypeptide of the invention, which comprises an amino acid sequence of SEQ ID NOs shown in 
Table 1 . To exert an antisense effect, an antisense polynucleotide has at least 15 bp or more, for example 50 bp or 

30 more, preferably 1 00 bp or more, and more preferably 500 bp or more, and usually has 3000 bp or less, and preferably 
2000 bp or less. Antisense polynucleotides can be used in the gene therapy of diseases caused by abnormalities of 
the polypeptides of the invention (abnormal function or abnormal expression). An antisense polynucleotide can be 
prepared, for example, by the phosphorothioate method ("Physicochemical properties of phosphorothioate oligodeox- 
ynucleotides." Stein (1 988) Nucleic Acids Res. 1 6: 3209-3221) based on the sequence information of polynucleotide 

35 encoding a polypeptide of the invention (for example, the nucleotide sequences of SEQ ID NO: 1 to 1639). 

[0050] The polynucleotides or antisense polynucleotides of the present invention can be used in, for example, gene 
therapy. As target diseases, for example, cancers or various inflammatory diseases may be preferable. These mole- 
cules can be used for gene therapy, for example, by administrating them to patients by the in vivo or ex vivo method 
using virus vectors such as retrovirus vectors, adenovirus vectors, and adeno-related virus vectors, or non-virus vectors 

40 such as liposomes. 

[0051] The present invention also includes a partial peptide of the polypeptides of the invention. The partial peptide 
comprises a polypeptide generated as a result that a signal peptide has been removed from a secretory protein. If the 
polypeptide of the present invention has an activity as a receptor or a ligand, the partial peptide may function as a 
competitive inhibitor of the polypeptide and may bind to the receptor (or ligand) In addition, the present invention 

45 includes an antigen peptide for raising antibodies. For the peptides to be specific for the polypeptide of the invention, 
the peptides comprise at least 7 amino acids, preferably 8 amino acids or more, more preferably 9 amino acids or 
more, and even more preferably 10 amino acids or more. The peptide can be used for preparing antibodies against 
the polypeptide of the invention, or competitive inhibitors of them, and also screening for a receptor that binds to the 
polypeptide of the invention. The partial peptides of the invention can be produced, for example, by genetic engineering 

50 methods, l<nown methods for synthesizing peptides, or digesting the polypeptide of the invention with an appropriate 
peptidase. 

[0052] The present invention also relates to a vector into which a polynucleotide of the invention is inserted. The 
vector of the invention is not limited as long as it contains the inserted polynucleotide stably. For example, if £. coli is 
used as a host, vectors such as pBluescript vector (Stratagene) are preferable as a cloning vector. To produce the 
55 polypeptide of the invention, expression vectors are especially useful. Any expression vector can be used as long as 
it is capable of expressing the polypeptide in vitro, in E. coii, in cultured cells, or in vivo. For example, pBEST vector 
(Promega) is preferable for in vitro expression, pET vector (Invitrogen) for £. coli, pIVIEI 8S-FL3 vector (GenBank Ac- 
cession No. AB009864) for cultured cells, and pME18S vector (Mol. Cell. Biol. (1988) 8: 466-472) for /nv/Vo expression. 
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To insertthe polynucleotide of the invention, ligation utilizing restriction sites can be performed according to the standard 
method (Current Protocols in IVIolecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .4-11.11). 
[0053] Recently, the technique of GATEWAY^" system (Invitrogen), which is an expression vector construction sys- 
tem for polypeptide expression, has been developed (Experimental Medicine, Vol. 1 8, No. 1 9 (December), p271 6-271 7, 
5 2000). This system includes two types of site-specific recombinases (BP CLONASE™ and LR CLONASE™) derived 
from lambda phage and uses BP CLONASETi^-speclflc recombination sites for an Entry Vector and LR CLONASET'^- 
specific recombination sites for a Destination Vector, which may comprise a tag useful for polypeptide purification. With 
this system, an expression vector can be obtained by using homologous recombination. 

[0054] ' First, a polynucleotide fragment of Interest Is Inserted into the entry vector using the first recombination. Then , 
10 the secondary recombination is allowed to take place between the entry vector, where the polynucleotide fragment of 
interest has been inserted, and the destination vector. Thus, the expression vector can be prepared rapidly and highly 
efficiently. With the above-mentioned typical method using restriction enzyme and ligase reactions, the step of expres- 
sion vector construction and expression of polypeptide of interest takes about 7 to 1 0 days. However, with the GATE- 
\NAY™ system, the polypeptide of Interest can be expressed and prepared in only 3 to 4 days. Thus, the system 
15 ensures a high-throughput functional analysis for expressed polypeptides (http://blotech.nlkkeibp.co.jp/netilnk/lto/gate- 
way/). 

[0055] The present invention also relates to a transformant carrying the vector of the invention. Any cell can be used 
as a host into which the vector of the invention is inserted, and various kinds of host cells can be used depending on 
the purposes. For strong expression of the polypeptide in eukaryotic cells, COS cells or CHO cells can be used, for 
20 example. 

[0056] Introduction of the vector into host cells can be performed, for example, by calcium phosphate precipitation 
method, eiectroporation method (Current Protocols in Molecular Biology (1987) Ausubel etal. edit, John Wiley & Sons, 
Section 9.1-9.9), lipofectamine method (GIBCO-BRL), or microinjection method, etc. 

[0057] Further, a polynucleotide containing at least 1 5 nucleotides comprising a nucleotide sequence of any one of 

25 the polynucleotides comprising the nucleotide sequences of SEQ ID NOs shown in Table 1 or the complementary 
strand thereof can be used not only as a primer for synthesizing full-length cDNAs but also for testing and diagnosing 
the abnormalities of the polypeptide encoded by the full-length cDNA of the present Invention. For example, by utilizing 
polymerase chain reaction (genomic DNA-PCR, or RT-PCR) using the polynucleotide of the invention as a primer, 
polynucleotide encoding the polypeptide of the invention can be amplified. It is aiso possible to obtain the regulatory 

30 region of expression In the 5'-upstream by using PGR or hybridization since the transcription start site within the genomic 
sequence can be easily specified based on the 5' -end sequence of the full-length cDNA. The obtained genomic region 
can be used for detection and/or diagnosis of the abnormality of the sequence by RFLP analysis, SSCP or sequencing, 
Especially, In the case where expression of the mRNA of the present Invention varies according to a specific disease, 
analysis of the amount of expression of the mRNA using the polynucleotide of the present invention as a probe or a 

35 primer enables detection and diagnosis of the disease. 

[0058] The present invention also relates to antibodies that bind to the polypeptide of the invention. There are no 
limitations in the form of the antibodies of the invention. They include polyclonal antibodies, monoclonal antibodies, or 
their portions that can bind to an antigen. They also include antibodies of all classes. Furthermore, special antibodies 
such as humanized antibodies and chimeric antibodies are also included. 

40 [0059] The polyclonal antibody of the Invention can be obtained according to the standard method by synthesizing 
an oligopeptide corresponding to the amino acid sequence and Immunizing rabbits with the peptide (Current Protocols 
In Molecular Biology (1987) Ausubel et al. edit, John Wiley & Sons, Section 11 .12-11 .13). The monoclonal antibody of 
the invention can be obtained according to the standard method by purifying the polypeptide expressed In E. coll, 
immunizing mice with the polypeptide, and producing a hybridoma cell by fusing the spleen cells and myeloma cells 

45 (Current Protocols in Molecular Biology (1987) Ausubel etal. edit, John Wiley & Sons, Section 11.4-11.11). 

[0060] The antibody binding to the polypeptide of the present invention can be used for purification of the polypeptide 
of the Invention, and also for detection and/or diagnosis of the abnormalities of the expression and structure of the 
polypeptide. Specifically, polypeptides can be extracted, for example, from tissues, blood, or cells, and the polypeptide 
of the invention is detected by Western blotting, immunoprecipitatlon, or ELISA, etc. for the above purpose. 

50 [0061 ] Furthermore, the antibody binding to the polypeptide of the present invention can be utilized for treating the 
diseases that associates with the polypeptide of the Invention. If the antibodies are used for treating patients, human 
antibodies, humanized antibodies, or chimeric antibodies are preferable In terms of their low antigenicity. The human 
antibodies can be prepared by immunizing a mouse whose immune system Is replaced with that of human (e.g., see 
"Functional transplant of megabase human immunoglobulin loci recapitulates human antibody response in mice" Men- 

55 dez, M.J. et al. (1997) Nat. Genet. 15: 146-156). The humanized antibodies can be prepared by recombination of the 
hypervariable region of a monoclonal antibody (Methods in Enzymology (1991) 203: 99-121). 
[0062] The use of the amino acid sequences of the polypeptides encoded by the cDNAs of the present invention 
enables predicting that the polypeptides have the following functions. It can be predict, from the results of homology 
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search of SwissProt, GenBank, UniGene, ornr, that these polypeptides have such functions. Specificaliy, for instance, 
as shown in Examples, searching for a known gene or polypeptide that Is homologous to the partial sequence of the 
full-length cDNA of the invention (1 639 clone) and referring the function of the gene and of the polypeptide encoded 
by the gene make it possible to predict the function of the polypeptide encoded by the cDNA of the invention. In this 
5 way, each of 892 clones out of the 1 639 full-length cDNA clones of the invention was predicted to encode a polypeptide 
that was classified into the following categories. 

Secretory and/or membrane protein (439 clones) 

Glycoprotein-related protein (87 clones) 
10 Signal transduction-related protein (46 clones) 

Transcription-related protein (140 clones) 

Disease-related protein (21 9 clones) 

Enzyme and/or metabolism-related protein (168 clones) 

Cell division- and/or cell proliferation-related protein (23 clones) 
'5 Cytoskeieton-related protein (60 clones) 

Nuciear protein and/or RNA syntliesis-related protein (69 clones) 

Protein synthesis- and/or transport-related protein (24 clones) 

Cellular defense-reiated protein (6 clones) 

Development and/or differentiation-related protein (19 clones) 
20 DNA- and/or RNA-binding protein (158 clones) 

ATP- and/or GTP-binding protein (63 clones) 

[0063] The functions of the polypeptides encoded by the cDNAs of the present invention can be predicted by as- 
sessing the presence of signal sequence, transmembrane region, nuclear translocation signal, glycosylation signal, 
25 phosphorylation site, and zinc finger motif, SH3 domain, etc. in the amino acid sequences. The programs, PSORT 
(Nakai K,, and Kanehisa IVI. (1992) Genomics 14: 897-911), SOSUI (Hirokawa T et al. (1998) Bioinformatics 14; 

378-379) (Mitsui Knowledge Industry), and MEMSAT (Jones D.T., Taylor W.R., and Thornton J.M. (1994) Biochemistry 
33: 3038-3049) can be used to predict the existence of the signal sequence or transmembrane region. Alternatively, 
a partial amino acid sequence of the polypeptide is fused with another polypeptide such as GFP, the fusion polypeptide 

30 is transfected into cultured cells, and the localization is analyzed to predict the function of the original polypeptide. 
[0064] Based on the determined nucleotide sequences of the full-length cDNAs obtained in the present invention, it 
is possible to predict more detailed functions of the polypeptides encoded by the cDNA clones, for example, by search- 
ing the databases such as GenBank, Swiss-Prot, UniGene, and nrfor homologies of thecDNAs; or by searching the 
amino acid sequences deduced from the full-length cDNAs for signal sequences by using software programs such as 

35 PSORT, for transmembrane regions by using software programs such as SOSUI or for motifs by using software pro- 
grams such as Pfam (http://www.sanger.ac.uk/Software/Pfam/index.shtml) and PROSITE (http://www.expasy.ch/ 
prosite/). As a matter of course, the functions are often predictable by using partial sequence information (preferably 
300 nucleotides or more) instead of the full-length nucleotide sequences. However, the result of the prediction by using 
partial nucleotide sequence does not always agree with the result obtained by using full-length nucleotide sequence, 

40 and thus, it is needless to say that the prediction of function is preferably performed based on the full-length nucleotide 
sequences. 

[0065] GenBank, Swiss-Prot, UniGene and nr databases were searched for homologies of the full-length nucleotide 
sequences of the 1639 clones (see Example 6). The amino acid sequences deduced from the full-length nucleotide 
sequences were searched for functional domains by PSORT, SOSUI and Pfam. Prediction of functions of polypeptides 
45 encoded by the clones and the categorization thereof were performed based on these results obtained. The catego- 
rization was earned out by the following method. 

[1] Firstly, the cDNA clones were classified into the above-mentioned Afunctional categories based on the results 
of annotation-based categorization (using the keywords in the case of Swiss-Prot hit data; using Definition or 
50 Reference information in the case of GenBank, UniGene, or nr hit data), and the signal sequence search of the 

deduced ORFs by PSORT and the transmembrane region search by SOSUI. 

[2] Secondly, clones which had been unassignable to the categories by the method of [1 ] were searched for func- 
tional domains and/or motifs by Pfam. Based on the results, the clones were additionally classified into the above- 
mentioned 14 types of categories when they had a functional domain and/or motif assignable to any one of the 
55 categories. 



[0066] The following 439 clones presumably belong to secretory and/or membrane proteins. 

3NB691 00001 80, 3NB691 0000850, 3NB6920000290, 3NB6920003300, 3NB6920005450, 3NB6920010020, 
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ADRGL1 00001 80, ADRGL1 0001 600, ADRGL20003230, BGGI1 2001 0970, BNGH41 0000340, BNGH41 0001 040, 
BNGH41 0001 180, BNGH41 0001 370, BNGH41 0001 980, BRACE1 0000730, BRACE1 0001 690, BRACE20002800, 
BRACE20007180, BRACE20010650, BRACE20011170, BRACE20011430, BRACE20013400, BRACE20013520, 
BRACE20014230, BRACE20014530, BRACE20014920, BRACE20015080, BRACE2001 8590, BRACE20022270, 
BRACE20024680, BRACE20026350, BRACE20026850, BRACE20030780, BRACE20031100, BRACE20034490, 
BRACE20071380, BRACE20071 970, BRACE20072810, BRACE2007401 0, BRACE20074470, BRACE20075020, 
BRACE20075380, BRACE2007641 0, BRACE20076630, BRACE20076850, BRACE2007761 0, BRACE20077640, 
BRACE20077980, BRACE20078680, BRACE20079530, BRACE20084430, BRACE20086550, BRACE20089600, 
BRACE20091880, BRAWH1 000001 0, BRAWH1 0000370, BRAWH1 0000940, BRAWH1 0001 620, BRAWH1 0001 800, 
BRAWH20001090, BRAWH20004430, BRAWH20006970, BRAWH20009840, BRAWH20011290, BRAWH2001141 0, 
BRAWH20011660, BRAWH200143B0, BRAWH20014840, BRAWH20015030, BRAWH20036930, BRAWH20038320, 
BRAWH20040950, BRAWH20052250, BRAWH20059980, BRAWH20087060, BRAWH2009261 0, CD34C20000510, 
CTONG20013660, CTONG200 15330, CTONG20028160, CTONG20037820, CTONG200471 60, DFNES20003350, 
FCBBF1 00061 80, FCBBF1 0006750, FCBBF20005910, FCBBF20007330, FCBBF20008150, FCBBF200094 
FCBBF200i53BO, FEBRA20003780, FEBRA20004040, FEBRA20004150, FEBRA20004520, FEBRA2000491 0, 
FEBRA20006560, FEBRA20006900, FEBRA20007330, FEBRA20008090, FEBRA20008800, FEBRA2001 0930, 
FEBRA20012270, FEBRA20012450, FEBRA20012940, FEBRA20013510, FEBRA20014870, FEBRA2001 4920, 
FEBRA20015840, FEBRA20020860, FEBRA20021910, FEBRA20025250, FEBRA20031 550, FEBRA20037070, 
FEBRA20041100, FEBRA20041 91 0, FEBRA20057780, FEBRA200631 50, FEBRA20066670, FEBRA20067930, 
HCASM 10000610, HCASM2Q002020, HEART20000990, HEART20004920, HHDPC20000950, HLUNG1 0000240, 
HLUNG1 0000370, HLUNG10001100, HLUNG20001160, HLUNG20001250, HLUNG20001420, HLUNG20001760, 
HLUNG20002550, HSYRA20003470, HSYRA20006290, HSYRA20008280, HSYRA20011030, HSYRA20013320, 
HSYRA20014200, HSYRA20015800, I MR32 10000440, IMR3210001580, I MR32 10002660, IMR3220007750, 
IMR3220008590, IMR3220009840, IMR3220014350, KIDNE1 0000080, KIDNE1 0001 040, KIDNE1 0001 430, 
KIDNE20000700, KIDNE20000850, KIDNE200016 KIDNE20003150, KIDNE200033 KIDNE20003490, 
KIDNE20004220, KIDNE20005170, KIDNE20005190, KIDNE20033050, KIDNE20033570, KIDNE20039410, 
KIDNE20042620, KIDNE20042950. KIDNE20044110, KIDNE20048280: KIDNE20049810, KIDNE20054000, 
KIDNE20054770, KIDNE20060530, KIDNE20060620, KIDNE20063530, KIDNE20063760, KIDNE20066520, 
KIDNE20067600, KIDNE20071 860. KIDNE20073520, KIDNE20074220, KIDNE20075690, LIVER1 0000580, 
LIVER1 0000670, LIVER1 0001 040, LIVER10001110, LIVER10001750, LIVER1 0005420, LIVER200041 60, 
MAMGL1 0000320, MAMGL1 0001840, MESAN1 0000350, MESAN1 0001 470, MESAN10001B00, MESAN20001 490, 
NB9N420000420, NHNPC20002060, NT2NE1 0000230, NT2NE1 0000830, NT2NE1 0001 630, NT2NE20003270, 
NT2NE20003920, NT2NE20004550, NT2NE20004700, NT2NE20005500, NT2NE200 12470, NT2NE20014350, 
NT2NE20016260, NT2NE20034080, NT2NE20047160, NT2NE20055170, NT2NE20057200, NT2RI20005970, 
NT2RI20009740, NT2RI20010100, NT2RI20014490, NT2RI20015400, NT2RI20015950, NT2RI2001 6570, 
NT2RI20018660, NT2RI20020220, NT2RI20021520, NT2RI20022430, NT2RI20022520, NT2RI20025300, 
NT2RI20030110, NT2RI20030510, NT2RI20031540, NT2RI20033010, NT2RI20033830, NT2RI20036780, 
NT2RI20042840, NT2RI20044420, NT2RI20049850, NT2RI20050870, NT2RI20051500, NT2RI20066820, 
NT2RI20068250, NT2RI20070480, NT2RI20070840, NT2RI20073030, NT2RI20074980, NT2RI20077540, 
NT2RI20078270, NT2RI20080500, NT2RI20081 880, NT2R12008481 0, NT2RI20085980, NT2RI20089420, 
NT2RI20092890, NT2RI20094060, NT2RP60000320, NT2RP60000390, NT2RP60001 090, NT2RP70000690, 
NT2RP70002380, NT2RP70002590, NT2RP70003640, NT2RP70011660, NT2RP7001591 0, NT2RP70021510, 
NT2RP70023760, NT2RP70023790, NT2RP70026190, NT2RP70029820, NT2RP70040800, NT2RP70043730, 
NT2RP70047900, NT2RP70049250, NT2RP70055200, NT2RP70064080, NT2RP70071540, NT2RP70071770, 
NT2RP70073810, NT2RP70074220, NT2RP70075040, NT2RP70076170, NT2RP70079250, NT2RP70079750, 
NT2RP70081330, NT2RP70081370, NT2RP70083150, NT2RP70085500, NT2RP70090120, NT2RP70091490, 
NT2RP70091680, NT2RP70092360, NT2RP70093220, NT2RP70093730, NT2RP70094290, NT2RP70094810, 
NT2RP700949B0, NT2RP70095070, NTONG1 0000980, NTONG10002140, NTONG1 0002570, NTONG20002650, 
NTONG20004920, NTONG20008000, NTONG20012220, 0CBBF1 0000420, OCBBF20002310, OCBBF20009980, 
OCBBF20012100, PANCR1 0000210, PLACE50000670, PLACE50000680, PLACE50001 050, PLACE50001130, 
PLACE60012810, PLACE6001 8860, PLACE60020160, PLACE60020840, PLACE60026990, PLACE60037050, 
PLACE60037450, PLACE60043960, PLACE60044540, PLACE600473B0, PLACE60049930, PLACE60050290, 
PR0ST1 0002200, PROST1 0002720, PR0ST1 0005260, PROST1 0005360, PROST20000360, PROST20026820, 
PROST20029600, PROST20 032320, PROST20033020, PROST20039220, PROST200441 60, PROST20051430, 
PROST20054260, PROST20058800, PROST200591 90, PROST20059430, PROST20069B80, PROST20072370, 
PROST20073890, PUAEN1 0000570, PUAEN1 0003220, SALGL1 0001 570, SKMUS20007740, SKNMC1 00001 90, 
SKNMC1 0000290, SKNMC1 0002290, SKNMC1 0002510, SKNMC20011130, SKNMC20015030, SMINT1 00001 60, 
SMINT10000 SMINT1 0000570, SMINT1 0001 180, SMINT20000180, SMINT20002770, SPLEN1 0000910, 
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SPLEN20001340, SPLEN20002430, SPLEN20002700, SPLEN20003100, SPLEN20004960, ST0MA1 0000520, 
ST0MA1 0001 170, STOMA20000320, STOMA20002570, SYNOV20001 770, SYNOV2001 6480, TESTI1 0000420, 



TEST1 10000960, 

TESTI20008830, 

TESTI20014200, 

TESTI20020810, 

TESTI20026320, 

TESTI2003193 

TESTI20049940, 

TESTI20074800, 



TESTI1 0001 270, 
TESTI20009090, 
TESTI20016210, 
TESTI20022510, 
TESTI20026980, 
TESTI20034190, 
TESTI20056900, 
TESTI20077490, 



TEST1 10001 380, 
TESTI20009700, 

TESTI20016710, 
TESTI20024230, 
TESTI20027000, 
TESTI20036490, 
TESTI20057420, 
TESTI20079510, 



TESTI20001770, 
TESTI20011340, 

TESTI20018520, 
TESTI20024650, 
TESTI20027070, 
TESTI20039980, 
TESTI20058600, 
TESTI20080200, 



TESTI20006000, 
TESTI20012370, 
TESTI20018620, 
TESTI20024670, 
TESTI20028660, 
TESTI20042870: 
TESTI20067740, 
TESTI20081440, 



TESTI20007620, 

TESTI20013520, 
TESTI20020020, 
TESTI20025800, 
TESTI20030370, 
TESTI20047120, 
TESTI20069780, 
TESTI20087740, 



TESTI20088470, TESTI201 36910, THYMU1 0000830, THYMU1 0001 760, THYMU1 0003290, THYMU1 0003820, 
THYMU 10005580, TRACH 10000630, TRACH1 0001 000, TRACH1 0001 400, TRACH20001850, TRACH20001960, 
TRACH20004200, TRACH20004960, TRACH20006650, TRACH20007670, TRACH20008980, TRACH2001 5920, 
UMVEN20001330, UTERU1 0000770, UTERU1 0000960, UTERU1 0001 920, UTERU20000470, UTERU20003930, 
UTERU20004850 

[0067] The following 87 clones presumably belong to glycoprotein-related proteins. 

BNGH41 0000340, BNGH410001180, BRACE200 14920, BRACE2001 5080, BRACE2001 8590, BRACE20024680, 
BRACE20026350, BRACE20031 1 00, BRACE20074470, BRAWH10000370^ BRAWH20001 090, BRAWH20011 660, 
BRAWH20014840, BRAWH20059980, CD34C20000510, CTONG20013660, CTONG200281 60, CTONG20037820, 
FCBBF20007330, FEBRA20007330, FEBRA20008800, FEBRA20014920, FEBRA20015840, FEBRA20057780, 
HEART20005060, HLUNG10001100, HLUNG20002550, HSYRA20013320, IMR321 0002660, IMR3220007750, 
IMR3220013320, KIDNE2004411 0, KIDNE20063760, KIDNE200676O0, KIDNE20073520, LIVER20000370, 
MESAN 10000350, NT2NE1 0000830, NT2NE10001B50, NT2NE20003270, NT2NE20016260, NT2RI2001 8660, 
NT2RI20025300, NT2RI20036780, NT2RI20077540, NT2RI20080500, NT2RI20085980, NT2RI20089420, 
NT2RI20092890, NT2RP70000690, NT2RP70004770, NT2RP70055200, NT2RP70081370, NT2RP70083150, 
NT2RP70091490, NT2RP70092360, NT2RP70094980, NT0NG1 00021 40, OCBBF2000231 0, OCBBF20002770, 
PLACE50000680, PLACE50001130, PLACE60018860, PLACE60044540, PROST2001 8230, PROST20032320, 
PROST20073890, SALGL1 0001 570, SKNMC20015030^ SMINT1 00001 60, SMINT20002770, SPLEN20001340, 
TEST1 10001 270, TESTI1 0001 380, TESTI20001 770, TESTI20024230, TESTI20027070, TESTI20036490, 
TESTI20039980, TESTI20056900, TESTI20057420, TESTI2007951 0, THYMU1 0001 760, TRACH1 0000740, 
TRACH1 0001 250, TRACH20004200, UTERU20000470 

[0068] The following 46 clones presumably belong to signal transduction-related proteins. 

ADRGL20000740, ASTR01 00001 80, BRACE20005770, BRACE20022020, BRACE20027360, BRACE20027920, 
BRAWH20006860, CTONG20005890, FEBRA20000350, HHDPC20000550, IMR3220003020, KIDNE20033730, 
KIDNE20040840, KIDNE20053360, KIDNE20062990, NT2RI20033440, NT2RI2005811 0, NT2RI20062100, 
NT2RI20073840, NT2RP70006240, NT2RP70043960, NT2RP70046870, NT2RP70061880, NT2RP70072520, 
NT2RP70081440, NT2RP70093700, NT0NG1 0001 820, PEBLM20004790, PLACE60026680, PROST20033400, 
PROST20043320, SKMUS1 0000220, SKMUS2001 6680, SPLEN20003570, TESTI20001540, TESTI20005910, 
TESTI20022560, TESTI20024980, TESTI20029120, TESTI20034980, TESTI20049820, TESTI20055840, 
THYMU 10003590, THYMU20003690, TRACH20002500, TRACH20002890 
[0069] The following 1 40 clones presumably belong to transcription-reiated proteins. 

3NB6920010220, 3NB692001511 0, 3NB692001 5570, ADRGL1 0000650, BGGI120006B40, BGGI1 20006930, 
BGGI120017140, BNGH41 0000800, BNGH420005320, BRACE 10000930, BRACE200 14550, BRACE2001 8550, 
BRACE20020910, BRACE20024090, BRACE20071740, BRAWH1 0000020, BRAWH1 0001 640, BRAWH1 0001 680, 
BRAWH20006330, BRAWH20009010, CTONG20025580, CTONG20028200, FCBBF1 0005980, FCBBF20000940, 
FCBBF20009510, FCBBF50002610, FEBRA20003970, FEBRA20003990, FEBRA20004540, FEBRA20009720, 
FEBRA20011460, FEBRA20017150, FEBRA20050140, FEBRA20064760, FEBRA20067360, FEBRA20069420, 
FEBRA20072800, HLUNG1 0000760, HLUNG20000680, HSYRA1 0001 370, HSYRA20016310, IMR32 10002420, 
IMR3220007420, KIDNE2000051 0, KIDNE20039940, KIDNE20061490, KIDNE20078110, NES0P1 0000870, 
NHNPC10001240, NHNPC20002120, NT2NE20002590, NT2NE20008090, NT2R12000341 0, NT2RI200041 20, 
NT2RI20004210, NT2Ri2001 0830, NT2RI2001 8460, NT2RI20025410, NT2RI20025850, NT2RI2006071 0, 
NT2RI20067350, NT2Ri20071 330, NT2RI20074390, NT2RI20078790, NT2RI200871 40, NT2RI20090650, 
NT2RI20092150, NT2RP60001 000, NT2RP60001 270, NT2RP70002710, NT2RP700081 20, NT2RP7001 8560, 
NT2RP70024500, NT2RP70032030, NT2RP70036290, NT2RP70042040, NT2RP7004541 0, NT2RP70046560, 
NT2RP70055130, NT2RP70061 620, NT2RP70062960, NT2RP70064900, NT2RP70069860, NT2RP70075370, 
NT2RP70085570, NT2RP70087200, NT2RP70090190, NTONG20003340, NTONG20003630, NTONG20015500, 
OCBBF20011010, OCBBF20011240, OCBBF20015860, PEBLM20002480, PEBLM20002700, PEBLM20003080, 
PEBLM20003950, PLACE60002050, PLACE60005550, PLACE60021510, PLACE60030380, PROST2001 8230, 
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PROST20031170, PROST20073170, PUAEN 10001 610, SALGL1 0000650, SKMUS1 0000640, SKMUS20014920, 
SKNMC20000650, SKNMC20 002240, SKNMC20003560, SMINT1 0001 000, SMINT20005450, SPLEN20000200, 
SPLEN20000720, SYNOV20010140, SYNOV20013740, SYNOV20014510, TESTI1 0000550, TESTI20001200, 
TESTI20007070, TESTI20010490, TESTI20015560, TESTI20018150, TESTI20018790, TESTI20021490, 
TESTI20026760, TESTI20027890, TESTI20030710, TESTI20034130, TESTI20042290, TESTI20053960, 
TESTI20074640, TESTI20074660, TESTI20078640, THYMU1 0004590, TRACH20000790, TRACH20002370, 
TRACH20009440, UTERU1 0001 600 

[0070] The following 219 clones presumably belong to disease-related proteins. 

ADRGL1 0000020, ADRGL1 0001 600, ADRGL20000740, ASTR0200041 70, BGGI1 20006840, BGGI1 2001 0970, 
BGGI120017140, BNGH41 0001 770, BNGH420005320, BRACE1 0001 870, BRACE20006980, BRACE200071 80, 
BRACE20014550, BRACE20018550, BRACE2001B590, BRACE20027550, BRACE20027720, BRACE20076850, 
BRACE200B6550, BRAWH1 0000020, BRAWH1 0001 640, BRAWH20001770, BRAWH20005030, BRAWH20005220, 
BRAWH20006330, BRAWH20006B60, BRAWH20009840, BRAWH20011660, CD34C20000510, CTONG20005890, 
CTONG20019110, CTONG20024180, CTONG2002558G, CTONG20037820, CTONG20055530, FCBBF20000940, 
FCBBF20009510, FCBBF40002820, FEBRA20001 050, FEBRA20003990, FEBRA20004150, FEBRA20004540, 
FEBRA20009720, FEBRA2001 0930, FEBRA2001 1 460, FEBRA20050790, FEBRA20057880, FEBRA20064760, 
FEBRA20067930, FEBRA200701 70, FEBRA2007551 0, FEBRA20075660, HCASM20002140, HEART20004480, 
HLUNG10001050, HLUNG20000680, HSYRAI 0001 370, HSYRA20006400, HSYRA20013320, HSYRA2001 631 0, 
IMR321 0000440, IIVIR322Q0Q791 0, KIDNE1 0001 040, KIDNE20003150, KIDNE20033730, KIDNE20042950, 
KIDNE20044110, KIDNE20050420, KIDNE20059080, KIDNE20063760, KIDNE20078110, LIVER1 0002300, 
LIVER1 0004330, LIVER20000330, LIVER20000370, MAMGL1 0001 780, MESAN1 0001 BOO, l\/IESAN20002910, 
MESAN20005010, NB9N41 0001 350, NHNPC10000840, NHNPC20002120, NT2N El 0000730, NT2NE20002990, 
NT2NE20003690, NT2NE20005170, NT2NE20005360, NT2NE200065B0, NT2NE20008090, NT2NE20013720, 
NT2NE20016340, NT2NE20055170, NT2RI20004120, NT2RI20004210, NT2RI20010910, NT2RI20014500, 
NT2RI20020410, NT2RI20029580, NT2RI20031540, NT2RI20033440, NT2RI20041900, NT2RI20056470, 
NT2RI20057230, NT2RI20067030, NT2RI20070960, NT2RI20074980, NT2RI20077540, NT2RI20080500, 
NT2RI200B3960. NT2RI20084810, NT2RI20092150, NT2RI20092890, NT2RP60000350, NT2RP60001 000, 
NT2RP60001230, NT2RP70000690, NT2RP70004250, NT2RP70028750, NT2RP70029060, NT2RP70032030, 
NT2RP70036290, NT2RP70042600, NT2RP70046560, NT2RP70049250, NT2RP70055020, NT2RP70062960, 
NT2RP70063040, NT2RP70065270, NT2RP70069860, NT2RP70071 770, NT2RP7007381 0, NT2RP70074220, 
NT2RP70075370, NT2RP70079250, NT2RP70081440, NT2RP70090120, NT2RP700901 90, NT2RP70093220, 
NT2RP70094980, NT0NG1 0002460, NTONG20003630, NTONG20015500, OCBBF10001180, OCBBF20008240, 
PEBLM1 0000340, PEBLM20002480, PEBLM20003080, PEBLM20003950, PLACE50000800, PLACE60002050, 
PLACE60003790, PLACE60014430, PR0ST1 0001 670, PR0ST1 0005360, PROST20002730, PROST20032320, 
PROST20033400, PROST20062600, PROST20072890, PROST20073890, PROST20085160, SALGL10001570, 
SKMUS 100001 40, SKMUS10001180, SKMUS1 0001 290, SKMUS20000740, SKMUS20003900, SKMUS20007240, 
SKIVIUS20016340, SKNMCI 0002510, SKNMC20000650, SKNIVIC20003220, SMINTI 0000420. SIVIINT1 0000570, 
SIVIINT1 0001 000, SIVIINT1 0001 030, SIVIINT20004000, SPLEN10001430, SPLEN20001 970, STOMA20000880, 
STOMA20003960, SYNOV20013740, SYNOV20014510, SYNOV20016480, TEST1 10001 270, TESTI10001310, 
TESTI20001200, TESTI20001 770, TESTI20002530, TESTI20006000, TESTI20006990, TESTI20007620, 
TESTI20008830, TESTI20011800, TESTI2001 2690, TESTI20015120, TESTI2001 8520, TESTI2001 8790, 
TESTI20021490, TESTI200251 60, TESTI20027070, TESTI20027290, TESTI20029120, TESTI20033250, 
TESTI20049820, TESTI20053960, TESTI20068660, TESTI20071 830, TESTI20074640, TESTI2007951 0, 
TESTI20086570, TESTI201 40360, THYMU1 0000830, THYMU1 0001 760, THYMU1 0003590, THYMU1 0004910, 
TRAGH20002370, UTERU1 0000960, UTERU20000470 

[0071 ] The following 1 68 clones presumably belong to the category of enzymes and/or metabolism-related proteins. 
3NB6920002810, ADRGL10001600, ADRGL1 0001 650, BGGI120005330, BNGH41 0000340, BNGH41 0001 770, 
BRAGE1 0000420, BRACE20015080, BRACE20022020, BRACE20024680, BRACE20026850, BRACE20027360, 
BRACE20027720, BRACE20027920, BRACE20071 380, BRACE20084430, BRAWH20001 770, BRAWH20006510, 
BRAWH200068e0, BRAWH20009840, BRAWH20011660, BRAWH20014180, BRAWH20014840, BRAWH20036890, 
BRAWH20059980, BRAWH20069890, BRAWH20089560, CTONG2001 3660, CTONG20019110, DFNES200021 20, 
FCBBF20007330, FCBBF200 15380, FEBRA20000350, FEBRA20001290, FEBRA2000311 0, FEBRA20024420, 
FEBRA20041100, FEBRA20045920, FEBRA20050790, FEBRA200521 60, FEBRA20062700, FEBRA200631 50, 
HEART20000350, HHDPC20000550, HHDPC20004550, HLUNG1 0001 050, HLUNG20002550, HSYRAI 0001 680, 
HSYRA20005100, HSYRA20015740, IMR3220008380, IMR32200091 90, IIVIR3220012180, IIVIR322001 31 70, 
KIDNE20000410, KIDNE20003490, KIDNE20004220, KIDNE20005130, KIDNE20033050, KIDNE20040840, 
KIDNE20046810, KIDNE20056290, KIDNE20060530, KIDNE20063760, KIDNE20068800, KIDNE20073280, 
KIDNE20073520, KIDNE20078100, LIVER1 0000670, LIVER1 0002300, MAMGL1 0001 780, l\/IESAN20002910, 
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MESAN20005010, NT2NE1 0000730, NT2NE1 0001 850, NT2NE20002140, NT2NE20003270, NT2NE20003690, 
NT2NE20005860, NT2NE20013720, NT2NE200 16340, NT2NE200 16660, NT2RI1 0000480, NT2RI20010100, 
NT2RI20015400, NT2RI20020220, NT2RI20025300, NT2RI20033010, NT2RI20036780, NT2RI20037510, 
NT2RI20051500, NT2RI20068550, NT2RI20073840, NT2RI20074980, NT2RI20084810, NT2RI2008791 0, 

5 NT2RP70004770, NT2RP70006240, NT2RP70011 660, NT2RP700261 90, NT2RP70062960, NT2RP70072520, 
NT2RP70076100, NT2RP70081440, NT2RP70084060, NT2RP70085570, NT2RP70093700, NT0NG1 0001 820, 
OCBBF20008240, OCBBF20012 OCBBF2001 4080. OCBBF20014940, PANCR1 000021 0, PEBLM20004790, 
PLACE50001050, PLACE500011 30, PLACE60003790, PLACE6001 281 0, PLACE6001 8860, PLACE60044540, 
PROST20031170, PROST20032320, PROST20033400, PROST20051210, PROST20064500, SKMUS1 0001 290, 

10 SKMUS10001770, SKMUS20000740, SKMUS20007240, SKMUS20008630, SKMUS20009330, SKMUS20011290, 
SKNSH1 0001 740, SKNSH20003470, SMINT1 00001 60, SPLEN20001340, ST0MA1 0001 860, STOMA20001210, 
STOMA20004820, SYNOV20016480, TEST1 10000700, TEST1 10001 380, TESTI20001540, TESTI20005910, 
TESTI20012690, TESTI20018270, TESTI20022560, TESTI20027070, TESTI20029120, TESTI200341 90, 
TESTI20034980, TESTI20040000, TESTI20042070, TESTI20042950, TESTI20047120, TESTI20049820, 

15 TESTI201 38320, TESTI201 40360, TESTI30000020, THYMU1 0000830, THYMU1 000491 0, THYMU200031 70, 
THYMU20003690, TRACH20000150, TRACH20004720, TRACH20004970, TRACH20009260, UTERU1 0000960 
[0072] The following 23 clones presumably belong to the category of cell division- and/or cell proliferation-related 
proteins, 

BGGI120001610, BRACE20027550, BRACE20076850, BRAWH20005030, BRAWH20005220, FEBRA20075660, 

20 HCASM20002140, HLUNG1 0000640, IMR3220009730, NT2NE20003840, NT2RI20006850, NT2RI20041 900, 
NT2RI2005B110, NT0NG1 0002460, NTONG20008780, SKMUS2001 6340, SKNMC20003220, SPLEN10001430, 
TEST1 10001 680, TESTI20001 840, TESTI20021 050, TESTI20035120, TESTI2005731 0 
[0073] The following 60 clones presumably belong to the category of cytoskeleton-related proteins. 
ADRGL1 0000020, BRACE20006980, BRACE2000B850, BRACE20027960, BRACE20074470, BRACE20076630, 

25 BRACE20078820, BRACE20093070, BRAWH20000480, BRAWH20066220, CTONG20019550, CTOIMG200281 60, 
CTONG20055530, DFNES20002680, FCBBF20005910, FEBRA20007720, FEBRA20008810, FEBRA20034290, 
FEBRA20043290, FEBRA20072000, HEART20004480, HEART20005200, HLUNG1 0001 100, HSYRA20006050, 
IMR3220007910, KIDNE20040840, KIDNE20052960, NT2RI2001 4090, NT2RI20032220, NT2RI2005851 0, 
NT2RI20090660. NT2RP70000690, NT2RP70004250, NT2RP70028750, NT2RP70042600, NT2RP70049250, 

30 NT2RP70074220, NTONG20009660, OCBBF20011760, OCBBF20015280, PEBLMI 0000680, PR0ST1 0001 670, 
PROST20033380, TESTI1 0000420, TESTI1 0000510, TESTI20003560, TESTI20004350, TESTI20006000, 
TESTI20006990, TESTI20008490, TESTI20008830, TESTI20011410, TESTI20015110, TESTI20016610, 
TESTI20020570, TESTI20024230, TESTI20031090, TESTI20031170, TESTI20039140, TESTI20078720 
[0074] The following 59 clones presumably belong to the category of nuclear proteins and/or RNA synthesis-related 

35 proteins. 

3NB6920002810, 3NB6920015280, BGGI120005440, BRACE1 0001 150, BRACE20024780, BRACE20027550, 
BRAWH20005030, BRAWH20014180, BRAWH20069890, CTONG20024180, FEBRA20001290, FEBRA20075660, 
HEART20003090, HLUNG1 0000640, HSYRA1 0001 680, HSYRA200051 00, IMR3220008630, IIVIR322001 21 80, 
MAMGL1 0001 780, NT2NE1 0001 850, NT2NE20002140, NT2NE20003B40, NT2NE2001 6660, NT2NE2005441 0, 

40 NT2RI20002820. NT2RI20006850, NT2RI2001 091 0, NT2RI20025540, NT2RI20041 900, NT2RI20053350, 
NT2RI20057230, NT2RI20060720, NT2RI20067030, NT2RI20068550, NT2RI20078840, NT2RI20087490, 
NT2RP70004770, NT2RP7001 3060, NT2RP70076430, NTONG20008780, PEBLMI 0000340, PLACE50000580, 
PLACE60003790, PROST20001 760, PROST20062600, SKMUS1 0000220, SKIVIUS2001 6340, SKNMC20003220, 
SPLEN10001430, SPLEN20001970, TESTI1 0001 680, TESTI20002530, TESTI20007840, TESTI20021 050, 

45 TESTI20029120, TESTI200351 20, TESTI20057310, TRACH20003930, TRACH20012890 

[0075] The following 24 clones presumably belong to the category of protein synthesis- and/or protein transport- 
related proteins. 

BRACE20078680, FEBRA2007551 0, IMR3220008380, KIDNE20005190, KIDNE20050420, MESAN20002910, 
NB9N41 0001 350, NT2NE20005360, NT2RI20032050, NT2RI20032220, NT2RP70000760, NT2RP70076430, 
50 NT2RP70093940, OCBBF20008240, PLACE50000580, PROST20000530, SKMUS20000740, SKMUS20008630, 
TESTI20007840, TESTI2001 51 20, TESTI2001 8690, TESTI2007B720, THYMU1 00055B0, UMVEN20001330 
[0076] The following 6 clones presumably belong to the category of cellular defense-related proteins, 
BRACE20014550, NT2RI2003751 0, NT2RI20053350, NT2RP70029060, NT2RP70062960, PLACE50001 700 
[0077] The following 1 9 clones presumably belong to the category of development and/or differentiation-related pre- 
ss teins, 

BGG1 120006930, CTONG20028200, FCBBF50002610, FEBRA20014920, FEBRA2001 7150, FEBRA20060920, 
MAMGL10001820, NESO PI 0000870, NHNPC1 0001 240, NT2R 120078790, NT2RP70008120, NT2RP7001 8560, 
NT2RP70045410, OCBBF20002770, SALGL1 0000650, SMINT1 0001 000, TEST1 10000550, TESTI20026760, 
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TESTI20078140 

[0078] The following 1 58 clones presumably belong to the category of DNA- and/or RNA-binding proteins. 
3NB6920002810, 3NB6920010220, 3NB6920015110, 3NB6920015570, ADRGL1 0000650, BGG 1 120006840, 
BGG1 120006930, BNGH41 0000800, BNGH420005320, BRACE20014550, BRACE2002091 0, BRACE20024090, 

5 BRACE2002478G, BRACE20071 740, BRAWH1 0001 640, BRAWH1 0001 680, BRAWH20000340, BRAWH20006330, 
BRAWH20009010, BRAWH20014180, BRAWH20069890, CTONG20025580, CTONG20028200, D3OST20001 840, 
FCBBF1 0005980, FCBBF2000951 0, FCBBF5000261 0, FEBRA20003970, FEBRA20003990, FEBRA20004540, 
FEBRA20008560, FEBRA20009720, FEBRA2001 71 50, FEBRA2001 7900, FEBRA20050140, FEBRA20064760, 
FEBRA20067360, FEBRA20069420, FEBRA20072aOO, HEART20003090, HLUNG1 0000760, HSYRA10001370, 

10 HSYRA20016310, IMR3210002420, IMR3220007420, IMR3220008630, KIDNE2000051 0, KIDNE20039940, 
KIDNE20061490, KIDNE20078110, NES0P1 0000870, NHNPC1 0000840, NHNPC10001240, NHNPC20002120, 
NT2NE20002590, NT2NE20003840, NT2NE20008090, NT2NE20016660, NT2NE20054410, NT2RI20003410, 
NT2RI20004210, NT2RI20006850, NT2RI2001 0830, NT2RI20010910, NT2RI20025410, NT2RI20025850, 
NT2RI20057230, NT2RI2006071 0, NT2RI20067350, NT2RI20071330, NT2RI20074390, NT2RI20078790, 

15 NT2RI2007B840, NT2RI200871 40, NT2R 120087490, NT2RI20090650, NT2RP60001 000, NT2RP60001270, 
NT2RP70002710, NT2RP70008120, NT2RP70013060, NT2RP7001 8560, NT2RP70024500, NT2RP70032030, 
NT2RP70042040, NT2RP7Q04541 0, NT2RP70046560, NT2RP700551 30, NT2RP70061 620, NT2RP70062960, 
NT2RP70064900, NT2RP70069860, NT2RP70075370, NT2RP70081 670, NT2RP70085570, NT2RP70087200, 
NT2RP70090i90, NTONG20003340, NTONG20008780, NTONG20015500, OCBBF2001 1 01 0, OCBBF2001 5860, 

20 PEBLM1 0000340, PEBLM20001120, PEBLM20002700, PEBLM20003080, PLAGE60002050, PLAGE60005550, 
PLACE60021510, PLACE600303B0, PROST20001 760, PROST20003250, PROST20018230, PROST20031170, 
PROST20062600, PROST20073170, SALGL1 0000650, SKMUS1 0000640, SKI\/IUS20014920, SKMUS20016340, 
SKNMC20000650, SKNMG20002240, SKNI\/IC20003220, SKNMC20003560, SMINT1 0001 000, SI\/IINT20005450, 
SPLEN1 0001 430, SPLEN20000200, SPLEN20000720, SPLEN20001970, SYNOV20010140, SYNOV20013740, 

25 SYNOV20014510, TEST1 10000550, TESTI20001200, TESTI20007070, TESTI20010490, TESTI20013450, 
TESTI20015560, TESTI20018150, TESTI20021050, TESTI2O02149 TESTI20026760, TESTI20027890, 
TESTI20030710, TESTI20033270, TESTI20034130, TESTI20035120, TESTI20053960, TESTI20074640, 
TESTI20074660, TESTI20078640, THYMU1 0004590, TRACH20000790, TRACH20002370, TRACH20009440, 
TRACH20012a90, UTERU10001600 

30 [0079] The following 63 clones presumably belong to the category of ATP- and/or GTP-binding proteins. 

3NB6920002810, BNGH41 0000390, BRACE20022020, BRACE20028120, BRACE20071 380, BRAWH20000480, 
BRAWH20006860, BRAWH20066220, CTONG20013200, DFNES20002680, FEBRA20043290, FEBRA200521 60, 
FEBRA20072000, FEBRA20075510, HHDPC20000550, HLUNG20001160, HSYRA1 0001 680, HSYRA200051 00, 
HSYRA20006050, KIDNE20040840, IVIAMGLI 0001 780, l\/IESAN20002910, NB9N41 0001 350, NT2NE20003690, 

35 NT2NE20005170, NT2NE20016660, NT2NE20055170, NT2RI20068550, NT2RI20073840, NT2RP70004250, 
NT2RP70011660, NT2RP70029060, NT2RP70036290, NT2RP70042600, NT2RP70046870, NT2RP70062960, 
NT2RP70081370, NT2RP70081440, NT2RP70093700, OCBBF20008240, OCBBF20015280, PEBLM20004790, 
PLACE50001700, PLACE60003790, PROST2001 8990, PROST20033400, SKMUS20008630, SMINT1 0000420, 
TESTI20001540, TESTI20003560, TESTI20005910, TESTi20006950, TESTI20006990, TESTI20008490, 

40 TESTI20015110, TESTI2001 661 0, TESTI20022560, TESTI20029120, TESTI20034980, TESTI20042290, 
TESTI20047120, TESTI20049820, TESTI2005731 0 

[0080] Among the clones other than the ones shown above, NT0NG1 0001 300 is a clone which was predicted to 
highly possibly belong to the category of secretory protein and/or membrane protein based on the result of domain 
search by Pfam. 
45 FEBRA20017060, NT2RI20066790, SMINT1 0000710 

[0081] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
glycoproteln-related protein based on the result of domain search by Pfam. 

BRAGE200B0970, BRACE20092120, BRAWH1 0001300, FEBRA20019890, KIDNE20031850, KIDNE20060140, 
MESAN20000920, NB9N41 0000470, NT2RI20071480, NT2RI20078910, NT2RP700B8550, NTONG20016120, 
50 0CBBF1 0000910, PROST20094830, SKNSH1 000301 0, SPLEN20002670, TESTI20031 960, TESTI20036250, 

TESTI20037810, TESTI20083870, TESTI201 77400 

[0082] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

3NB6920009120, 3NB6920014710, BRACE1 0001 660, BRACE20083850, BRAWH20004760, BRAWH20012030, 
55 CTONG20011390, CTONG20018200, FEBRA20007870, FEBRA20043250, HHDPC20003150, NT2RI1 0000270, 
NT2RI20036950, NT2RI20053680, NT2RI20072540, NT2RI20083360, NT2RP70030550, OCBBF2001 3070, 
OCBBF20015270, PLACE60046630, PROST1 0003430, PROST20067370, SKMUS1 0001 040, SKNMC20015960, 
TESTI20030050, TESTI20033540, TESTI20035890, TESTI20068720, TRACH2000411 0 
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[0083] The 29 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription-related protein based on the result of domain search by Pfam. 

BNGH41 0001 900, BRACE20080970, BRACE20092120, BRAWH20093600, FEBRA20003770, FEBRA20024290, 
HLUNG10000990, KIDNE20004030, MESAN20000920, NB9N420001040, NT2NE10000140, NT2NE20001 740, 

5 NT2Ri20050610, NT2Ri20055640, NT2R 120072540, NT2RI20074690, NT2RP60000860, NT2RP70036470, 
NT2RP70036800, NT2RP7007221 0, NT2RP70074060, NT2RP70084870, NT0NG1 0001 300, NT0NG1 0002640, 
NTONG20016120, 0GBBF1 0000910, OCBBF1 0001 190, OCBBF200071 90, SKMUS20001 1 70, SKMUS2001 6620, 
SKNIV1C20000970, SKNiVIC20015960, SYN0V1 0001 280, TESTI20002380, TESTI20006270, TESTI20013300, 
TESTi20031520, TESTi20036250, TESTI2003781 0, TESTi20064830, TESTI20083870, TRACH20006750, 

10 TRACH20016070 

[00B4] The 43 clones shown above are clones which were predicted to highly possibly belong to the category of 

enzyme and/or metabollsm-reiated protein based on the result of domain search by Pfam. 

NT2RI20064120 

[0085] The 1 cione shown above is a clone which was predicted to highly possibly belong to the category of ceil 
15 division and/or ceil proliferation-related protein based on the result of domain search by Pfam. 
BRACE20083800, KiDNE20004970 

[0086] The 2 clones shown above are clones which were predicted to highly possibly belong to the category of 
cytoskeieton-reiated protein based on the result of domain search by Pfam. 

3NB6920009120, 3NB692001 471 0, BRACE1 0001 660, BRACE20083850, BRAWH20004760, BRAWH20012030, 
20 BRAWH20064500, GTONG20011390, CTONG20018200, FEBRA20007870, FEBRA20043250, HCASiVI20003070, 
HHDPG20003150, NT2RI1 0000270, NT2RI20036950, NT2RI20053680, NT2R 120072540, NT2RI20083360, 
NT2RP70012310, NT2RP70030550, NT2RP70036470, OCBBF20013070, OCBBF20015270, PLACE60046630, 
PR0ST1 0003430, PROST20067370, SKMUS1 0001 040, SKNMC20000970, SKNMC20015960, TESTI20030050, 
TESTI20032280, TESTI20033540, TESTI20035890, TESTI20068720, TRACH2000411 0 
25 [0087] The 34 clones shown above are clones which were predicted to highly possibly belong to the category of 
DNA- and/or RNA-bindIng protein based on the result of domain search by Pfam. 
NT2RI20064120 

[0088] The 1 cione shown above is a cione which was predicted to highly possibly belong to the category of ATP- 
and/or GTP-binding proteins based on the result of domain search by Pfam, 
30 [0089] The 1 85 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
but have been predicted to have some functions based on homology search using theirfuli-length nucleotide sequences 
and motif search In their estimated ORFs. 

3NB6910001160, ASTRO20004170, BNGH41 0000030, BNGH41 0001 900, BRACE20005250, BRACE20014770, 
BRAGE20016730, BRACE20017370, BRACE2002431 0, BRACE20028960, BRACE20077840, BRACE20083850, 

35 BRAWH20003230, BRAWH20009440, BRAWH20076050, CTONG2001 8200, CTONG2002721 0, CTONG20064490, 
DFNES20004320, FCBBF1 0006870, FCBBF20002760, FCBBF20012110, FEBRA20000530, FEBRA20005360, 
FEBRA20007570, FEBRA20011330, FEBRA20019890, FEBRA20030540, FEBRA20043250, FEBRA20044900, 
FEBRA20048180, FEBRA20053800, FEBRA20068730, FEBRA200701 70, HCASM1 000021 0, HCASM20005360, 
HHDPG20001150, HHDPG200014g0, HLUNG1 0000990, HSYRA1 0001 1 90, HSYRA20001 350, HSYRA20006400, 

40 iMR3220002230, ilVIR322001491 0, KiDNE10001520, KIDNE20003750, KIDNE20004970, KIDNE20005740, 
KiDNE20031850, KIDNE20043440, KiDNE20056760, KIDNE20059080, KIDNE20060140, KIDNE20060300, 
KiDNE20067750, LIVER1 0000790, LIVER1 0004330, MESAN1 0001 01 0, MESAN20000920, NB9N41 0000470, 
NBgN420001040, NB9N420004950, NT2NE1 00001 80, NT2NE1 0000630, NT2NE20013370, NT2NE2001 6970, 
NT2NE20035690, NT2NE2005371 0, NT2RI20006690, NT2RI20013420, NT2Ri20013850, NT2RI20015190, 

45 NT2RI20016210, NT2Ri20022700, NT2RI200251 70, NT2RI20029260, NT2RI20029580, NT2RI20043040, 
NT2RI20061830, NT2Ri20064120, NT2RI20065060, NT2RI20074690, NT2RI20077230, NT2RI20082210, 
NT2RI20083960, NT2Ri20088120, NT2RP60000080, NT2RP60000350, NT2RP60000720, NT2RP60000860, 
NT2RP70009060, NT2RP70010800, NT2RP70022430, NT2RP70028290, NT2RP70033040, NT2RP70036320, 
NT2RP70036800, NT2RP70042330, NT2RP70049150, NT2RP70052050, NT2RP70055020, NT2RP70063040, 

50 NT2RP70072210, NT2RP70084410, NT2RP70084870, NTONG1 0000520, NTONG1 0001 230, NT0NG1 0001 300, 
OGBBF1 0001 220, OCBBF200071 90, OCBBF20011400 OCBBF20014020, OCBBF20014940, PEBLiV110001440, 
PEBLM20002130, PLACE50000370, PLAGE50000800, PLACE60014430, PLACE600241 90, PLAGE60033990, 
PLAGE60038500, PLAGE60043970, PLACE60044640, PROST20023380, PROST20034720, PROST20067370, 
PROST20079740, SALGL1 0000470, SKiVlUSI 00001 40, SKMUS1 0001 040, SKMUS1 0001 1 80, SKMUS20001170, 

55 SKiVlUS20003650, SKMUS20003900, SKMUS20004580, SKIVIUS20009020, SKiVlUS20009540, SKMUS2001 0080, 
SKMUS20011470, SKMUS20015430, SKMUS20016620, SKNMC20000970, SKNMG20015960, SiVIINTI 0001 030, 
SMINT20001450, SMINT20003960, SiVIINT20004000, SPLEN20002670, SYNOV1 0001 280, SYNOV20002910, 
SYNOV20008200, TEST1 10000250, TEST1 10000640, TESTI1 0001 310, TESTi10001910, TESTI20000440, 
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TESTI20002070, TESTI20002080, TESTI20014120, TESTI20016650, TESTI20022230, TESTI20022940, 
TESTI20024610, TESTI20027290, TESTI20030050, TESTI20030590, TESTI20030740, TESTI20035510, 
TESTI20035740, TESTI20041220, TESTI20052680, TESTI20054080, TESTI20064830, TESTI20065720, 
TESTI20068660, TESTI20071830, TESTI20078670, TESTI20083870, THYMU 10000020, THYMU1 0002910, 

5 TRACH1 0000300, TRACH20006750, TRACH20007800, TRACH20008940, TRACH20013950 

[0090] Further, the reason is that a polypeptide does not always beiong solely to a single category of the above- 
described functional categories, and therefore, a polypeptide may belong to any of the predicted functional categories. 
Besides, additional functions can be found for the clones classified into these functional categories by further analyses. 
[0091] Since the polypeptide encoded by clones of the invention contains full-length amino acid sequence, it is pos- 

10 sible to analyze its biological activity, and its effect on cellular conditions such as cell proliferation and differentiation 
by expressing the polypeptide as a recombinant polypeptide using an appropriate expression system, injecting the 
recombinant into the cell, or raising a specific antibody against the polypeptide. 

[0092] The biological activities of respective polypeptides can be analyzed by the methods as shown below. 

Secretory protein, transmembrane protein: 

15 

"Ion Channels" (Ed., R. H. Ashley, 1995) of "The Practical Approach Series" (IRL PRESS), 

"Growth Factors" (Eds., I. McKay, I. Leigh, 1993), 

"Extracellular Matrix" (Eds., M. A. Haralson, J. R. Hassell, 1995) ; 

20 Glycoprotein-related protein: 

"Glycoblology" (Eds., M. Ful<uda, A. Kobata, 1993) of 'The Practical Approach Series" (IRL PRESS), 
"Glycoprotein Analysis in Biomedicine" (Ed., Elizabeth F.Hounseil, 1993) of "IVIethod in Molecular Biology" (Hu- 
mana Press) series; 

25 

Signal transduction-related protein: 

"Signal Transduction" (Ed., G. Milligan, 1992) of "The Practical Approach Series" (IRL PRESS), 
"Protein Phosphorylation" (Ed., D. G. Hardie, 1993), or 
30 "Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J.Hill, 1995) of "l\^ethod in Molecular Biology" 

(Humana Press) series; 

Transcription-related protein: 

35 "Gene Transcription" (Eds., B. D. Hames, S. J. Higgins, 1993) of "The Practical Approach Series" (IRL PRESS), 

"Transcription Factors" (Ed., D.S.Latchman, 1993); 

Enzyme and/or metabolism-related protein: 

40 "Enzyme Assays" (Eds., ROBERT EiSENTHAL and MICHAEL J. DANSON, 1992) of "The Practical Approach 

Series ' (IRL PRESS); Cell division and/or cell proliferation-related protein: 

"Cell Growth, Differentiation and Senescence" (Ed., GEORGE STUDZINSKI, 2000) of 'The Practical Approach 
Series" (IRL PRESS) ; 

45 Cytosl<eleton-related protein: 

"Cytosl<eleton: Signalling and Cell Regulation" (Eds., KERMIT L. CARRAWAY and GAROLIE A. CAROTHERS 
CARRAWAY, 2000) of "The Practical Approach Series" (IRL PRESS), 

"Cytosl<eleton Methods and Protocols" (Ed., Gavin, Ray H., 2000) of "Method in Molecular Biology" (Humana 
50 Press) series; 

Nuclear protein and/or RNA synthesis-related protein: 

"Nuclear Receptors" (Ed., DIDIER PICARD, 1999) of "The Practical Approach Series" (IRL PRESS), 
55 "RNA Processing" (Eds., STEPHEN J. HIGGINS and B. DAVID HAMES, 1994); 

Protein synthesis and/or transport-related protein: 
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"Membrane Transport" (Ed., STEPHEN A. BALDWIN, 2000) of "The Practical Approach Series" (IRL PRESS), 
"Protein Synthesis IVIethods and Protocols" (Eds., Martin, Robin, 1998) of "Method in Molecular Biology" (Humana 
Press) series; 

5 Cellular defense-related protein: 

"DNA Repair Protocols" (Henderson, Daryl S., 1999) of "Method in Molecular Biology" (Humana Press) series, 
"Chaperonin Protocols" (Eds., Schneider, Christine, 2000); 

10 Development and/or differentiation-related protein: 

"Developmental Biology Protocols" (Eds., ROBERT EISENTHAL and MICHAEL J. DANSON, 1 992) of "Method in 
Molecular Biology" (Humana Press) series; 

15 DNA- and/or RNA-binding protein: 

"DNA-Protein Interactions Principles and Protocols" (Eds., Kneale, G. Geoff, 1994) of "Method in Molecular Biol- 
ogy" (Humana Press) series, 

"RNA-Protein Interaction Protocols" (Eds., Haynes, Susan R., 1999); 

20 

ATP- and/or GTP-binding protein: 

"Signal Transduction Protocols" (Eds., David A. Kendall, Stephen J. Hill, 1995) of "Method in Molecular Biology" 
(Humana Press) series. 

25 

[0093] In the categorization, the clone predicted to belong to the category of secretory and/or membrane protein 

means a clone having hit data with some annotation, such as growth factor, cytokine, hormone, signal; transmembrane, 
membrane, extracellular matrix, receptor G-proteIn coupled receptor, ionic channel, voltage-gated channel, calcium 
channel, cell adhesion, collagen, connective tissue, etc., suggesting that it was a secretory or membrane protein, or 
30 a clone in which the presence of nucleotide sequence encoding a signal sequence or transmembrane region was 
suggested by the results of PSORT and SOSUI analyses for deduced GRP. 

[0094] The clone predicted to belong to the category of glycoprotein-related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoprotein-related protein, 
[0095] The clone predicted to belong to the category of signal transductlon-related protein means a clone having hit 
35 data with some annotation, such asserine/threonine-protein kinase, tyrosine-protein kinase, SH3 domain, SH2 domain, 
etc, suggesting that the clone encodes a signal transductlon-related protein. 

[0096] The clone predicted to belong to the category of transcription-related protein means a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc., suggesting that the clone encodes 
a transcription-related protein. 

40 [0097] The clone predicted to belong to the category of disease-related protein means a clone having hit data with 
some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, UniGene, or nr, where 
the hit data corresponds to genes or polypeptides which have been deposited in the Online Mendelian Inheritance in 
Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is the human gene and disease database described later. 

45 [0098] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism-related protein. 

[0099] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 
50 apoptosis, etc., suggesting that the clone encodes a cell division and/or cell proliferation-related protein. 

[01 00] The clone predicted to belong to the category of cytoskeleton-related protein means a clone having hit data 
with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 
clone encodes a cytoskeleton-related protein. 

[01 01 ] The clone predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein means 
55 a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA helicase, 
polyadenylation, etc., suggesting that the clone encodes a nuclear protein and/or RNA synthesis-related protein. 
[0102] The clone predicted to belong to the category of protein synthesis and/or transport-related protein means a 
clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
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thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting that the clone encodes a protein 
synthesis and/or transport-related protein. 

[0103] The clone predicted to belong to the category of cellular defense-related protein means a clone having hit 
data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
5 cellular defense-related protein. 

[0104] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting that the clone encodes 
a development and/or differentiation-related protein. 

[0105] The clone predicted to belong to the category of DNA- and/or RNA-binding protein means a clone having hit 
10 data with some annotation, such as DNA-bindIng, RNA-binding, etc. 

[0106] The clone predicted to belong to the category of ATP- and/or GTP-blnding protein means a clone having hit 
data with some annotation, such as ATP-binding, GTP-blnding, etc. 

[0107] As to a protein involved in a disease. It is possible to perform a functional analysis as described above, but 
also possible to analyze correlation between the expression ortheactivity of the protein and a certain disease by using 

15 a specific antibody that is obtained by using expressed protein. Alternatively, it is possible to utiiize the database OMiM, 
which Is a database of human genes and diseases, to analyze the protein. Further, new Information Is constantly being 
deposited in the OMIM database. Therefore, it is possible for one skilled in the art to find a new relationship between 
a particular disease and a gene of the present invention in the most up-to-date database. The proteins involved in 
diseases are usefui for developing a diagnostic marker or medicines for regulation of their expression and activity, or 

20 as a target of gene therapy. 

[0108] Also, as for a secretory protein, membrane protein, signal transduction-related protein, glycoprotein-related 
protein, or transcription-related protein, etc., search of the OIVIIM with the following keywords resulted In the finding 
that the proteins are involved in many diseases (the result of the OMIM search for secrete and membrane proteins is 
shown below). Also, association between proteins related to signal transduction or transcription and diseases is re- 

25 ported in "Transcription Factor Research-1 999" (Fujii, Tamura, Morohashi, Kageyama, and Satake edit, (1 999) Jikken- 
Igaku Zoukan, Vol.17, No.3), and "Gene Medicine" (1999) Vol.3, No.2). When cancer is used as an example, as de- 
scribed in "Biology of Cancer" (S. Matsubara, 1992) of Life Science series (Shokabo), many proteins are involved in 
cancers, which include enzyme and/or metabolism-reiated proteins, cytoskeleton-reiated proteins, cell division and/or 
cell proliferation-related proteins as well as secretory proteins, membrane proteins, signal transduction-related proteins, 

30 glycoprotein-related proteins, transcription-related proteins. As clearly seen by the above example, it is evident that 
not only disease-related proteins but also secretory proteins, membrane proteins, signal transduction-related proteins, 
glycoprotein-related proteins, transcription-related proteins, etc. are often involved in diseases, and thus they can be 
useful targets in the field of medical industry. 

[0109] The result of the OMIM search for secretory and membrane proteins is shown below, in which the keywords, 

35 

(1) secretion protein, 

(2) membrane protein, 

(3) channel, and 

(4) extracellular matrix were used. 

[0110] Shown in the search result are only the accession numbers in the OMIM. Using the number, data showing 
the relationship between a disease and a gene or protein can be seen. The OMIM data has been renewed everyday. 

1) Secretion protein 

45 354 entries found, searching for "secretion protein" 

*604667, *104760, *176860, *151675, *139320, *107400, *604029, M18910, #200100, *176880, *603850, 
*1 47572, '604028, "179513, '125950, '139250, '246700, '600946, '600560, '602926, 1 85860, '605083, '603215, 
'602421, '157147, '179512, '600174, '109270, '604710, '138120, '179510, '600998, '179509, '170280, 
'179511, '600626, '603831 , '601489, '154545, '179490, 

50 '603826, '122559, '603216, '102720, '147290, '164160, '603062, '112262, '602672, '605435, '605322, 

'131230, '601652, '603166, '601746, '601591, '179508, #160900, '104311, '600759, '147545, '167805, 
#104300, '167770, #219700, '168470, '601684, '602049, '601146, '605227, '602434, '602534, '114840, 
'603489, '604323, '107470, '600753, '600768, '118825. '600564, 

'604252, '173120, '134370, '192340, '308230, '600322, '605359, '600046, '300090, 106160, '600041, 
55 #262500, '605563, '150390, '158106, '182590, #103580, '104610, #173900, '134797, '143890, #145980, 

'306900, '308700, '176300, '227500, '137350, #154700, '138079, '600760, '107730, '142410, '147670, 
'124092, '590050, '152760, '600509, '605646, '201910, '227600, 

'152790, '300200, '300300, 300800, '1381 60, '107741, '1201 50, '601 199, '1201 80, '1201 60, '176730, '1331 70, 
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*122560, *107300, *137241, *120140, MOIOOO, *193400, *217000, *272800, *600937, #201710, *600377, 
#174800, *106100, #274600, *1 73350, #177170, *1 47620, *214500, *131244, *202110, *120120, *601007, 
*191160, *1 47470, *603372, *600733, *252800, *190160, 

*138040, '158070, ♦162151, #125700, #130070, *113811, *603355, *171060, *136435, #184700, *603732, 
*190180, *1 64008, *1 86590, *1 20220, '604312, *1 52200, *138130, *605085, *605353, *600840, #166210, 
*188545, *207750, "173360, *601933, #194050, M53450, *138850, *253200, *307030, *157145, *600514, 
*600262, *264080, *1 47380, *600281 , #204000, #227810, *232200, 

*188826, *232800, *161561, #166200, *188400, M53620, *182099, *218040, #265800, *172400, #177200, 
*1 76805, #211600, #214700, #176410, *1 52780, *600633, *601771, *301500, *605402, *601922, *307800, 
*147892, '147720, *312060, #520000, *147660, *106150, *602358, *107270, *601769, *147440, *604558, 
*131530, *600270, *601610, *603692, *603401 , *600423, *601604, 

*603345, #125853, *602843, *142640, *603044, *605740, *134830, *602779, *130660, *139191, *137035, 
*600761, *601340, *600823, *107740, *130160, *600877, *605110, *600945, *130080, *600957, #130060, 
*605580, '118444, *601 1 24, *1 24020, 122470, '120700, *603201 , *137216, *6011 85, '138945, *21 8030, *600839, 
#240600, #262400, #162300, *162330, '188450, #265850, 

'263200, '162641, '300159, '601038, #191390, '201810, '601398, '602384, '131240, '602423, '139392, 
'142703, '602663, '232700, '602682, #602722, '602730, '600734, '188540, '182452, '601538, '603061, 
'146880, '603140, '603160, '142704, #252650, '182280, '125255, '603252, #131750, '182139, '182100, 
#259420, #261100, '603493, '601745, '182098, '603795, '123812, 

'600264, '147940, '180246, '180245, '118888, #604284, '168450, '118455, '604398, '604433, '601919, 
'118445, '600031, '6.04961, '605032, '605033, '171050, #171300, '131243, '109160, '605254, 274900, 
#171400, '600042, '151670, '184600, '605470, '605546, '176760, '602008, '102200, '605720, '600732, 
'605901 

2) Membrane protein 

1489 entries found, searching for "membrane protein" 

'130500, '605704, '305360, '153330, '173610, '109270, '170995, '170993, '104776, '602333, '309060, 
'605703, '120920, '605943, '602690, '159430, '600897, '133090, '601178, '602413, '602003, '604405, 
'605940, '603237, '109280, '600378, '602173, '107776, '602334, '602335, '125305, '601134, '309845, 
'605731, '154045, '603241, '603718, '600594, '603214, '185881, 

'603657, '600182, '603177, '605331, '601476, '605456, '601114, '605190, '600723, '603904, '136950, 
'300222, '602879, '185880, '605348, '300096, '602257, '177070, '310200, '603062, '603344, '600039, 
'602977, '300100, '128240, '600959, '600322, '227400, '186945, '600946, '602534, '602048, '182900, 
'601097, '600267, '602625, '136430, '602421 , '601047, '107450, 

'143450, '603141, '184756, '164730, '159440, '154050, '600579, '312080, '604202, '603700, '600447, 
'256540, '604691, '158343, '600403, '602414, '137290, '176640, '176981, '600179, '600754, '604456, 
'604693, '605875, '604605, '188860, '300172, '602910, '604323, '219800, '601848, '603179, '600279, 
'602251 , #222700, '603831 , '605072, '605377, '601028, '604155, 

'108733, '104225, '601896, '601510, '173335, '107770, '601767, '600046, '603850, '600040, '603784, 
'603234, 1 88560, '605863, '121015, '605862, '605861 , *1 86946, '604252, '60321 5, '1 42461 , '604597, '6031 43, 
'605264, '603735, '176860, '605536, '176801, '180721, '603355, '104760, '131560, '310300, '602631, 
'304700, #309400, '603142, '143890, '605431 , '600753, 

'115501, '176790, '600266, '601691, '168468, '601239, '602216, #104300, '605613, '601595, '605550, 
'125950, '605475, '602217, '602261, '603534, '602262, '604631, '190315; '601313, '604306, '104311, 
'604672, '605000, '602461, '605548, '602296, '604376, '121014, '121011, '600691, '604262, '139310, 
'304040, '605445, '179514, '179512, '151460, #160900, '120130, 

*1 28239, '601158, '601 403, *1 76943, '601 014, 300800, '300294, '601 757, '1 85470, '273800, '605034, '602887, 
#185000, '604871, '603593, '603583, '605454, '104775, '605872, '141180, '602713, '603531, '139150, 
'601531, '601832, '605452, '134651, '604156, '120620, '605883, '604142, '166945, '605324, '600816, 
'604699, '300112, '605182, '600164, '182180, '605071, 

'300023, '605057, '308240, '300249, '176947, '176894, '605081, '605035, '602044, '182860, '107271, 
'305100, '153390, '113730, '602689, '180069, '603518, '300017, '191275, '177061, '601693, '601789, 
'604241, '600934, '138160, '604424, '603868, '600174, '600718, '600523, '604141, '601009, '605251, 
'600481, '600874, '155550, '605227, '601017, '162230, 601138, 

'604157, '601212, '600763, '604110, '604158, '601107, '601326, 600621 , '600587, 601137, '600917, '600855, 
'605058, '194355, '605194, '603291, '102720, '136425, '170715, '603216, '605547, '135630, '602926, 
'600168, '605002, '602474, '600157, '603025, '603893, '231200, '120090, '601966, '131230, '604722, 
'604721, '604515, '246700, '602101 , '605628, '303630, 

'605787, '602857, '602285, '605708, '602488, '605025, '60381 7, '300051 , '603293, *1 76878, '603646, 605707, 
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185860, *1 12205, *300187, *602654, *1 20070, *603648, *604850, *602655, *602514, *300118, *1 82309, *1 79590, 
*602701, '600759, *204200, *604170, *175100, #103580, *1 47670, *306400, *143100, "182870, *257220, 
*1 80380, #116920, *301000, *1 93300, *157147, 

*131550, *139200, *139130, *190195, *605406, *155760, *155960, *605734, *155970, *605385, *111700, 
5 *155975, *1 50370, 605709, *151430 *605438 *1 51 510, *116952,*157655,*1581 05, *605777,*1 76877, *153619, 

*120131, M85430, *109190, "120190, *109170, *605093, *605250, *153432, *107777, *186590, M60993, 

*605699, *605698, *605813, *605697, *605616, *605300, 

*162060, *605219, *163970, '135620, '165040, '605478, '604964, '103195, '604932, '604923, '605906, 
*605496,*605914,*166490, 138277, '60491 5, *1 14070, '605213, '605933, *1 80297, *1 01 000, *1 911 63, *1 911 64, 
10 *605101, "603167, '600772, '603164, "600708, "604001, '191328, "313440, "602672, "604009, '604299, 

*192974, "604256, *603048, "600515, "604221 , "602632, 

"6041 96, "601 1 79, 603290, "604661 , "601 023, "601 1 1 0, "304800, "203200, "30021 2, "602933, "603352, "208900, 
"604418, "604838, "600551, #212140, "604837, "602049, "600552, "600553, "300213, "602574, "600583, 
"600932, "603452, "604775, "516020, "604617, "604464, '603498, "300145, "601523, "602694, '600632, 

15 "604762, "604492, '400015, '604504, '601717, '601728 

"300242, "602426, '604194, '603821, '604730, *600695, '603823, "603869, "300241, "600707, '603822, 
"602370, '602202, '604193, '601181, '604089, '602507, '604195, '602306, "300284, "601805, "601895, 
"601275, "604660, '600752, '603820, "604192, "602207, '308230, "600894, "312600, "603199, '604029, 
"602500, '102680, '235200, #256300, '601633, #219700, 262890, 

20 "156225, '173470, '193400, '173910, '600354, '113705, '600065, "107741, "107400, '600024, '131195, 

"113811, #118220, "601638, "300011, "276903, "604144, "311770, "601758, #173900, "604592, "120120, 
"179605, '603130, '603372, "110750, "222900, '602509, '256100, "602469, "602281, "229300, "224100, 
"110900, "190180, "261600, "602997, "603616, "603189, 601791, 

"601567, "312700, "171060, "308700, "604027, "162643, "516000, "176261, "604028, "314850, #145980, 
25 "601383, "600930, "305900, "601253, "136350, "605537, "138140, "604033, "605070, "139250, "300500, 

"603967, "300041, "603866, #130600, "120150, "601050, "604942, "605204, "605248, "272750, "600163, 
"604235, "600682, "107266, "306900, "191092, #262500, "600106, 

"152790, "186720, '227650, '153700, "308380, "103390, '605646, '164920, '604478, #252650, '173850, 
"173350, "602505, "246530, '194380, "602575, "603030, #209920, '212138, #214100, '605767, '600582, 
30 "189980, #176200, "604653, "604678, "256550, "300037, "253700, #253300, #226700, "604766, #244400, 

"190000, "188040, "604824, '214500, #237500, "232300, "605014, 

"604477, "190930, "605124, '604475, "604594, #227810, '306700, #301050, "600135, "600143, "605145, 
#269920, "300104, "277900, "300135, "300231, "192500, "182138, "191190, "176805, "600185, "186591, 
"604889, "603051, "165360, "147545, "601040, #156575, "107269, "603009, "602934, "123825, "601081, 

35 "602924, "163890, "600381 , "602909, "150330, "109690, "123900, 

"603434, "603491, "110700, "602581, "125647, #154700, "114760, "141900, "603690, "120220, "601199, 
#145500, "601309, "602382, "120325, "600877, "604205, "604090, "601497, "602377, "605464, "138720, 
"603728, '120950, '604026, '600580, "601610, '137167, '603960, "603931, "601880, "603126, '138190, 
"130130, '601997, "601975, '600395, "516040, '600418, '600650, 

40 "605245, "605172, '600509, "164761, "310400, "600308, '605109, "600544, "600359, "600103, "605267, 

"312610, '176100, "308100, '158070, '605123, "173325, #312750, "600839, "158120, #604369, "604465, 
"173510, #161200, "151525, "605369, "604237, "516050, #600886, "604517, "165180, "605381, "605399, 
"307800, "604365, "155740, "147795, 601709, '604673, "147730, 

"602122, "147557, "193245, "600978, "604990, '603261, "603274, "601007, "131100, "602941, "107941, 
45 "146710, "276901, "131244, "602872, "603411, "186357, '176290, "601066, "185050, '232200, "143030, 

"601843, #236700, "604122, "142800, "134638, "604985, "182380, "603930, "142410, "137060, "604586, 
"601193, '120650, "252500, "253800, "120930, "604858, "605874, 

601274, "602158, "605873, "193210, "203100, "601295, "604095, #201710, "126150, "108740, #205400, 
'601373, '300167, '109545, '602894, '603361, #300257, "266200, '603401, "131390, "180470, '605908, 

50 "604798, #221770, "223360, "180901, "605641, "605745, "604018, "300200, "604603, "230800, "602676, 

#604004, "605692, "602640, "601599, '134637, "245900, '118425, 

601 614, "605725, "1201 1 0, "3001 89, "300035, "6031 02, '250800, '602282, "602458, "12361 0, "603754, "300278, 
"601463, "300224, "601581, "182160, "601653, '139191, '601733, '600748, "142460, "601194, "152390, 
"153620, "601615, "601814, "601617, "601613, '300191, #308300, "600798, 601858, "601872, "601597, 
55 #601588, "600821 , "147840, "152427, "138850, "600823, 

"601 492, "300256, "600840, "300267, "601 41 1 , "1 39080, "1 39090, 600851 , "300334, "1 79080, "602095, "601 284, 
"601282, #177200, "601681, "601252, "176000, "602184, "602188, #266510, #154020, "186711, "257200, 
"601711, "600667, "602241, "186745, "255125, "300126, "600644, "123890, #255120, #175200, "600004, 
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*302060, *1 23580, *1 86760, *1 22561 , *602316, *600017, 

*1 20940, 1 40300, *1 51 690, *1 20700, *602354, *60001 9, *600857, *1 821 75, *600536, '1 58380, *60051 6, *1 20290, 
*600493, *182310, #252010, *182530, *186830, *601839, *142790, *159465, *118990, *250790, *248600, 
#248250, *186845, *601153, *142600, *116930, *114860, *171834, #303600, *186880, *600444, *142871, 

5 *601852, *602602, *602607, *1 14207, *1 86910, #232220^ 

600880, *1 34635, "112203, #112100, *11 1680, *231680, *311030, 250, *1 11200, *1 34390, #226670, #145600, 
*226200, *602714, *171760, *1 33550, *602727, *161555; *602744, *602746, #131705, *602835, *600423, 
*1 76267, *602859, #600918, 277175, '602874, *601020, *1 09770, *600170, *217070, M73515, *602893, 
*147280, *154360, *171050, M08780, *176257, *600979, 

10 *600377, *1 08360, *204500, *170260, *146880, *154582, *601011, *600997, *602992, *201475, *603005, 

*190198, *1 47360, #270400, *600238, #164970, *306250, #126600, *1 93065, #181350, *106180, *602136, 
*600937, *603086, *603087, *307030, *1 82099, *1 03320, *601683, #192430, *103180, *1 02681, *1 92321, 
*600244, *191740, *191315, *603152, *102642, *191305, #266140, 

*1 00500, *600867, *604585, *604404, *604345, *603201, *605430, *603207, *603208, *605433, *604101, 
15 *603969, *605B96, *604616, *605851, *605768, *604576, *605754, *605730, *605477, *603263, *605538, 

*603283, '604402, *605453, *605427, *603302, '605458, 60331 3, *604415, *603345, *605541 , *603353, *605295, 
*603879, *605268, *605266, *605246, '603377, *6033a0, 

'605181, '604203, '603425, '603867, '605106, '605017, '603842, '604936, '603510, '604857, '605932, 
'605816, '603765, '603551, '605357, '605237, '604204, '603594, '605110, '604190, '603861, '604962, 
20 '603639, '603644, '605007, '605349, '604943, '604918, '604907, '603667, '603681, '605396, '605561, 

'603712, '603713, '605688, '605942, '604878, '604843, '604659, 

'604671, '603798, '604682, '604056, '604705, '603749, 602586, '603647, '602515, #602475, '603717, 
*602359, '602372, '602380, '60251 8, '603652, '602573, '603626, 602587, '603598, '602871 , '60361 3, '603750, 
'603875, '602608, '602666, '602345, '602935, '603564, '603548, '603927, 601 876, '602343, '603943, '603787, 

25 '601730, '601611, '602679, '603788, '602243, 

603790, '601 535, '603796, '601 488, '601 485, '60231 4, '601 478, '604047, '604048, '602297, '604057, '60271 5, 
'602192, '601459, '601416, '603833, '602190, '604102, '602106, '604111, '602724, '603499, '602736, 
'601123, '601002, '600923, '601987, '604149, '601929, '600910, '600900, '600864, '604165, '600782, 
'602836, '600769, '600742, '602783, '601905, '600535, 

30 '604198, '601901, '600534, '602876, '603356, '600530, '604216, '604217, '602890, '602905, '600465, 

'600464, '600446, '602891, '603366, '601894, '604272, '603926, '603312, '600368, '602914, '600327, 
'603151, '603202, 602911, '602974, '603006, '601883, '603008, '600074, '603007, '603046, #603903, 
'604433, '600016, '603925, '516005, '516004, '516003, '601756, 

'604487, '516001, '313475, '313470, #307810, '604527, '604528, '601745, '604551, '604555, '603243, 
35 '603242, '603061, '603063, '603217, '300335, '300283, '300281, '604600, '300197, '603097, '603220, 

'601625, '604623, '603118, '601590, '604646, '300008, '601568, '300007, '275630, '601533, #275200, 
'270200, #261550, '604031 , '604683, #254800, '251100, #242300, 

'604058, '604720, '240500, '233690, #232240, #226730, '223100, '222100, #220100, '216950, '604832, 
212750, 212067, '604066, '193067, 601315, '193001 , '604862, '604870, '191306, '600385, '604879, '191191 , 
40 '601296, '604914, '190181, '604119, #188550, '604925, '188410, #601287, '604939, '188380, '604126, 

'604945, '604148, '188060, '604982, '186854, '604988, 

'1 86360, '1 86355, *1 85250, '60091 6, '605008, '605009, 1 85020, '600734, '605024, *1 82331 , '605032, '605033, 
'182305, '180903, #179800, '179610, '605060, '179410, '178990, '176802, '605080, '176266, '176263, 
'1 76260, '600732, *1 73490, '6041 99, *1 73445, '1 73391 , 1 72290, '6051 47, '6051 49, '1 71 890, '600528, '171 833, 

45 '6051 85, #1 70500, '6051 93, #1 68000, '6051 96, 

'167055, '605205, '605208, 166900, '605216, '162651, '162010, '600504, #161400, '604253, #160800, 
'159460, '154540, '605254, '605261, '153634, '600429, '153337, '600424, '605292, #604286, #152700, 
152423, '152310, '151625, '6001 53, '604313, '151523, '150325, '150320, '150292, '603150, '150290, '150210, 
'605410, '605415, '605416, '605417, '605421 , '603149, 

50 '604349, '147940, '600282, '147880, '146928, '146661, '600150, '146630, '142622, '600018, '605461, 

'138981, '138590, '600023, '138330, '605495, '138297, '605512, '138230, #136900, #301310, '516006, 
'605545, '605546, '136131, '134660, '134350, '516002, '605589, '131235, #130050, '605625, '126455, 
'126064, #125310, '605670, '604534, '125240, '123836, '123830, 

'123620, '605702, #122200, '120980, '120360, '118510, '114835, '605710, '605716, '605722, '114217, 
55 '604561, '113810, '111740, #110800, '605748, '605752, '604564, '110600, '603160, '109610, '605784, 

#107480, '107273, '603192, '300169, '106195, '105210, '104615, '104614, '104210, '103850, 103581, 
'605876, '605877, '605879, '103220, '605887, '300150, '102910, 
'102670, '102576, '605916, '604629, '102575, '102573, '300132, '101800, '605947 
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3) Channel (member of membrane protein) 
361 entries found, searching for "channei" 

*1 76266, *600724, *1 82390, *1 23825, *1 14208, *1 14206, *1 76267, *1 14205, *601784, *600937, *1 14204, 
*603415, ♦600053, *114207, *114209, *605427, *604527, *604528, *600760, *601011, *192500, *118425, 
5 *600228, *1 76261, *602235, *600761, *600359, *300008, *1 82389, *600877, *602232, *1 76263, *1 82391, 

*601328, *600054, *603939, *602208, "601534, '600504, '602323, 

*603208, '601958, '603537, '601012, '601327, '600734, '602780, '602781, '604433, '603220, '182392, 
'605874, '605873, '601745, '603888, '603219, '602604, '603796, '302910, *602866, '601013, '602905, 
'602906, '603967, '600163, #170500, '152427, '180901, '176260, #601462, *603951, *601141, '604492, 

10 '600702, '602023, '600308, '602754, '107776, '176257, '602024, 

'601949, '605222, '601142, '602983, '193245, '600681, '176265, *600235, *1 76262, '176258, '605206, 
'604427, '605411, '603305, '601219, '600150, '604065, '602343, '605223, '605720, '603906, '138249, 
'138253, '600B43, '604385, '600003, '600935, '603940, '602727, '602158, 602911 , '600397, '602726, '600845, 
'605080, '600580, '602872, '602106, '176264, '603953, 

15 '605722, '30011 0, *1 38252, '6041 11 , '60271 7, '602420, '600570, 600844, '603493, '600932, '60571 6, '138254, 

'603652, '300138, '605410, '176268, '605214, '605696, '300334, '604660, '176256, '605879, '603749, 
'603583, '602345, '604661 , '603787, 60331 3, '602982, '604337, '600846, '604662, '300328, '300281 , '602566, 
'602836, '604003, '603788, '603651 , '602421 , 

'107777, #177200, '100725, #219700, '100690, '100710, #160800, #603830, #183086, '600509, #220400, 
20 #601144, '173910, '180902, '605692, #264350, #160900, #145600, #255700, '602076, '603061, '601313, 

'154275, #604233, '604532, #108500, #121201, #170400, '300225, '121014, '139311, #125800, #160120, 
'118503, 601439, #141500, #168300, '304040, #601887, #256450, 

'186945, '154276, #300009, #216900, '600040, '601014, '601042, '602512, '601383, '605445, '602368, 
'603831, #117000, '601218, '108745, '605248, #177735, #173900, '601212, '182139, '601059, '600039, 
25 '601485, '180903, '186360, '603319, #600101, '118509, '600109, #121200, '600170, '604187, '176975, 

'137163, #310468, #263800, #262300, '603750, '600229, '124030, 

'602251, #603829, '137143, #145500, '600669, '147450, '154050, '603353, '600516, '601157, '600855, 
'601154, '602522, '249210, '600968, #252650, '171060, '600919, '156490, #259700, #601678, '601764, 
#310500, '131244, '300041, '121011, '125950, '114180, '602974, '600637, '113730, '118504, '605145, 

30 '604669, '118800, '121013, '121015, '138491, '600421 , '104610, 

'604045, '604594, '131230, '605487, '138247, '600467, #602485, '602481, '138251, '137192, '602403, 
600851, '277900, '603785, '603152, '603199, '603475, #168600, #272120, '170280, '603852, #241200, 
'603053, '600465, #603034, '142461, '164920, '137164, '600884, '600442, '123885, '604001, '600232, 
'232200, '171050, '602103, '602014, '300211 , '600983, '602887, 

35 *60441 5, '60441 8, '300242, #300071 , '604471 , '600837, 1 68350, *1 1 851 1 , 1 93007, '600300, '604654, #601 820, 

'180297, '600046, '603853, '604678, '604693, #604772, '118508, '603855, '605204, #254210, '182099, 
'182307, #130600, '601109, '114080, '300103, '182860, '605438, *601129, *603964, '600019, '516060, 
#185000, '138079, '104210, '605818, '603418, '305990, '305450 

4) Extracellular matrix 

40 218 entries found, searching for "extracellular matrix" 

'605912, '603479, '602201, '604633, '601418, '601548, '115437, '154870, '600754, '602261, '602285, 
'602262, '134797, '120361, '604629. '604871, '603321, '603320, '601807, #154700, '116935, '185261, 
'120360, '185250, '605470, '603767, '253700, '190180, '128239, '308700, '276901, '193300, '120324, 
'188826, '602109, '155760, '600514, '600261 , #1 77170, '600536, 

45 '147557, #116920, '150240, '601313, '120140, 601614, '605158, '120150, '120180, #200610, '605127, 

'193400, '192240, #173900, '152200, #136900, '135821, #130070, '120320, '120220, '112260, '310200, 
'600900, '600262, '605670, '600985, '179590, #245150, '602574, '601463, 183850, '601211, '604241, 
'600758, '186745, '604710, '602369, '602090, '190182, '192975, 

'602178, '230740, '600065, '601652, '158106, *190181, '156790, #158810, '193210, '155120, '192977, 
50 '193065, #226700, '187380, '231050, '182120, '188060, '186355, 163200, '164010, #156550, '151510, 

'150370, '253800, '156225, '150325, #194050, '150290, '216550, '147620, '600215, '222600, '147559, 
'165380, '182888, '600491 , '146650, '146640, '600564, '600596, 

'600616, '600700, '600742, '138297, '182889, '154705, '600930, '301870, '153619, '601050, '601090, 
'601105, '165070, '305370, '135820, '130660, '310300, '601492, '128240, '601587, #126600, '601636, 
55 '60011 9, '601 692, '601 728, '125485, 601 858, '601915, '602048, '175100, '6021 08, '121010, '600245, '120470, 

'120328, '120325, '602264, '120280, '602366, '600309, 

'602402, '602415, '602428, '602453, '602505, #166210, '602600, '602941, '603005, '603196, 603209, 
'603221, '603234, '603319, '120250, '120210, '120120, '603489, '603551, '118938, '603799, '603842, 



56 



EP 1 293 569 A2 



*603924, *603963, *604042, *604063, *604149, *604160, *601028, *604467, *604510, *604592, *116930, 
*116806, *601284, "604724, *604806, *604807, *604808, *107269, 

*605007, *605008, *605009, *600214, *600076, *605174, *605175, *605292, *605343, *605351, #600204, 

*605497, *605546, *605587, *605623, *600211 , *605702, *103320 

5 

[0111] In addition to these, the various l<eywords shown in the above-mentioned categorization or others can be 
used for the OiVliiVI search and the resuit may suggest the involvement thereof in diseases. 
[01 1 2] Further, the use of nucleotide sequences of cDNAs of the present invention enables analyzing the expression 
frequency of genes corresponding to the cDNAs. in addition, functions of the genes can be predicted based on the 

10 information obtained by the expression frequency analysis. 

[0113] There are several methods for analyzing the expression levels of genes Involved In diseases. Differences In 
gene expression levels between diseased and normal tissues are studied by the analytical methods using, for example. 
Northern hybridization, RT-PCR, DNA microarray, etc. (Experimental iVIedlclne, Vol.17, No. 8, 980-1056 (1999); Cell 
Engineering (additional volume) DNA IVIicroarray and Advanced PGR Methods, Muramatsu & Nawa (eds.), Shujunsya 

15 (2000)). By computer analysis, in addition to these analysis methods, the nucleotide sequences of expressed genes 
can be compared to analyze the expression frequency. For example, there is a database called "BODYMAP"; gene 
clones are extracted at random from cDNA libraries of various tissues and/or ceils, and the clones homologous to one 
another are assigned to a single cluster based on the information of nucleotide sequence homology at the 3'-end; 
genes are classified into any clusters, and the numbers of clones in the respective clusters are compared to gain the 

20 information on expression frequency (http;//bodymap.ims.u-tokyo.ac.Jp/). 

[0114] When explicit difference In the expression levels between diseased tissues and nonnal tissues is observed 
for a gene by these analytical methods. It can be conclude that the gene Is closely Involved In a disease or disorder 
Instead of diseased tissues, when gene expression Is explicitly different between normal cells and cells reproducing 
disease-associated specific features, it can be concluded that the gene Is closely Involved In a disease or disorder. 

25 [0115] From the 1639 clones whose full-length nucleotide sequences had been revealed, genes Involved in particular 
pathology or functions were selected by the use of databases shown below (see Example 7; "Expression frequency 
analysis in silico'). The database used In the analyses of the present Invention contains nucleotide sequences of 
770,546 clones, and the population of the database Is large enough for the analysis. The sequence infonnatlon in the 
database was obtained by selecting cDNA clones at random from cDNA libraries derived from the various tissues and 

30 cells shown In Example 1 and determining the 5'-end sequences thereof. 

[01 16] Then, the nucleotide sequences of respective clones In this database were categorized (clustered) based on 
the nucleotide sequence homology determined with a search program; the number of clones belonging to every cluster 
of each library was determined and normalized; thus, the ratio of a certain gene In a cDNA library was determined, 
This analysis provided the Information of the expression frequency of a gene In a tissue or cell that Is the source of the 

35 cDNA library. 

[0117] Then, in order to analyze the expression of genes corresponding to the nucleotide sequences of cDNAs of 
the present invention in tissues and cells, the libraries from the tissues or cells, which had been used in the large-scale 
cDNA analyses, were taken as subjects to compare the expression levels between different tissues or cells. Namely, 
the expression frequency was analyzed by comparing the previously normalized values between tissues or cells from 
40 which 600 or more cDNA clones whose nucleotide sequences had been analyzed were derived. The result of this 
analysis showed that the cDNA clones corresponded to the genes Involved in the pathology and functions, which are 
indicated below. Each value in Tables 3 to 39 indicated below represents a relative expression frequency; the higher 
the value, the higher the expression level. 

45 Osteoporosis-related genes 

[0118] Osteoporosis is a pathology In which bones are easily broken owing to overall decrease in components of 
bone. The onset correlates to the balance between the functions of osteoblast producing bone and osteoclast absorbing 
bone, namely bone metabolism. Thus, the genes involved In the Increase of osteoclasts differentiating from precursor 
50 cells of monocyte/macrophage line (Molecular Medicine 38. 642-648. (2001)) are genes involved In osteoporosis rel- 
evant to bone metabolism. 

[0119] A nucleotide sequence information-based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower In CD34+ cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 
entiation factor (Molecular Medicine 38. 642-648. (2001)) than In the untreated CD34+ ceil, which is the precursor ceil 
55 of monocyte/macrophage line. The result of comparative analysis for the frequency between the cDNA libraries pre- 
pared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ cells treated with the osteoclast differentiation 
factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the two were 
41 clones indicated in Table 3. These clones are involved in osteoporosis. 
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Genes involved in neural cell differentiation 

[0120] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
5 logical diseases. 

[0121] A survey was performed for genes whose expression levels are varied in response to Induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a, neural 
strain, NT2. The result of comparative analysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RIVI) and 
the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
10 levels were different between the two were 500 clones indicated in Table 4. These genes are neurological disease- 
related genes. 

Cancer-related genes 

15 [0122] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
thus the expression thereof can contribute to the carcinogenesis in tissues and cells. Thus, genes whose expression 
patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
for the genes whose expression levels in cancer tissues were different from those In normal tissues. 
[0123] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 

20 (BEAST) showed that the genes whose expression levels were different between the two were 1 1 clones Indicated in 
Table 5. 

[01 24] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
(CERVX) showed that the genes whose expression levels were different between the two were 1 0 clones indicated in 
Table 6. 

25 [0125] The result of comparative analysis of cDNA libraries derived from colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two were 5 clones indicated in 

Table 7. 

[0126] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (NESOP) showed that the genes whose expression levels were different between the two were 5 clones 

30 indicated in Table 8. 

[0127] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were 205 clones Indicated 

in Table 9, 

[01 28] The result of comparative analysis of cDNA libraries derived from llvertumor (TLIVE) and normal liver (LIVER) 
35 showed that the genes whose expression levels were different between the two were 35 clones indicated in Table 10. 
[0129] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and nonnal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were 62 clones Indicated in 
Table 11. 

[0130] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
40 (NOVER) showed the genes whose expression levels were different between the two were 7 clones Indicated in Table 
12. 

[0131] The result of comparative analysis of cDN A libraries derived from stomach tumor (TSTOM) and normal stom- 
ach (STOMA) showed thatthe genes whose expression levels were different between the two were 41 clones indicated 
in Table 13. 

45 [0132] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were 94 clones indicated in 
Table 14. 

[01 33] The result of comparative analysis of cDN A libraries derived from tongue cancer (CTONG) and normal tongue 
(NTONG) showed that the genes whose expression levels were different between the two were 1 78 clones indicated 

50 in Table 15. 

[0134] These genes are Involved in cancers, 

[0135] Further, there Is a method to search for genes Involved in development and differentiation, which is the ex- 
pression frequency analysis in which the expression levels of genes are compared between developing and/or differ- 
entiating tissues and/or cells and adult tissues and/or cells. The genes involved in tissue development and/or differ- 
55 entlation are genes participating in tissue construction and expression of function, and thus are useful genes, which 
are available for regenerative medicine aiming at convenient regeneration of injured tissues. 
[0136] By using the information of gene expression frequency gained from the database of 5'-end nucleotide se- 
quences described above, genes involved in development or differentiation of particular tissues were selected from 
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the 1639 clones whose full-length nucleotide sequence had been revealed (see Example 7). 
[01 37] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRA or OCBBF) and 
adult brain (BRACE, BRALZ, BRAIVIY BRAWH, BRCAN, BRCOC, BRHIP BRSSN, BRSTN or BRTHA) showed that 
the genes whose expression levels were different between the two were 745 clones indicated in Tables 1 6 to 36. 
5 [0138] The result of comparative analysis of cDNA libraries derived from fetal heart (FEHRT) and adult heart (HEART) 
showed that the genes whose expression levels were different between the two were 54 clones indicated in Tabie 37. 
[0139] The resuit of comparative analysis of cDNA libraries derived from fetal kidney (FEKID) and adult kidney 
(KiDNE) showed that the genes whose expression levels were different between the two were 145 clones indicated 
in Table 38. 

10 [0140] The result of comparative analysis of cDNA libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were 63 clones indicated in Table 39. 
These genes are Involved In regeneration of tissues and/or cells. 

[0141] The expression frequency or the like can be analyzed by PGR based on the nucleotide sequences of cDNAs 
of the present invention. There are some known methods for comparing the quantities of amplification products obtained 

15 by PGR. For exampie, the band intensities can be determined by ethidium bromide staining. With Ri-labeled or fluo- 
rescently labeled primers, the Rl signal or fluorescence intensity can be assayed for the quantity of labeled ampiification 
products. Alternatively, the quantity of amplification products can aiso be determined by measuring the Rl signal orthe 
fiuorescence intensity from the Ri-labeled orfluorescently iabeied probe hybridizing to the products. The assay resuits 
thus obtained are compared and then the clones exhibiting differences in the expression ievels can be selected. 

20 [0142] There are some quantitative PGR methods: a PGR method using internal standards; a competitive PGR, in 
which the quantification is achieved by adding, to a sample, a dilution series of a known quantity of a template RNA 
and by comparing the quantity of an amplification product derived from the RNA of Interest with the quantity of an 
amplification product derived from the template RNA. These methods overcome the problems of errors in the amount 
of amplification products among tubes and of the plateau effect. ATAG-PCR (Adaptor-tagged competitive PGR) Is a 

25 method of competitive PGR which is practiced by using multiple adapters of different sizes attached to a gene whose 
3'-end nucleotide sequence has previously been detemriined. The ratio of expression frequency of a single mRNA 
species from a number of tissues (cells) can be assayed in a single step (Nucleic Acids Research 1997, 25(22): 
4694-4696; "DNA Micro-array and Advanced PGR Techniques", Cell Technology, supplement, Eds., Muramatsu and 
Nawa (Shujunsha, 2000): 104-112), 

30 [0143] If It Is observed, by using these analytical methods, that the expression levels of genes are evidently varied 
during major cellular events (such as differentiation and apoptosis), the genes are involved in the ceilular events and 
accordlngiy are candidates for disease- and/or disorder-related genes. Further, genes exhibiting tissue-specific ex- 
pression are genes playing important parts in the tissue functions and, therefore, can be candidates for genes involved 
In diseases and/or disorders affecting the tissues. 

35 [01 44] For example, inflammation is an important biological response that is known to be involved in various diseases. 
The representative inflammation-inducing factors include TNF-a (Tumor Necrosis Factor-alpha), LPS (Lipopolysac- 
charides), etc. Many genes have been identified as genes located downstream of the TNF-a or LPS stimuiation. The 
respective stimulations are transduced through independent pathways of signaiing cascade. There exists another sig- 
naling cascade for both stimulations, wherein NF-kB is a common transducing molecule shared by the two stimulations 

40 (Cell 1995, 80:529-532). it has also been revealed that many inflammation-related genes, including IL-2, IL-6 and 
G-CSF, are varied in the expression levels thereof in response to the signal through the common pathway (Trends 
Genet. 1999, 15(6): 229-235). It is assumed that genes whose expression levels are varied in response to the stimu- 
lation of TNF-a or LPS also participate in inflammation. 

[0145] Further, the infection of Heiicobacter pyloriXo the gastric epithelia is known to cause gastritis and gastrodu- 
45 odenal ulcer (Mebio 2000, July 1 7(7): 1 6-33). Thus, the genes whose expression levels are altered depending on co- 
culturing cells with Helicobacter pylori ma^ be involved in gastritis and gastroduodenal ulcer A recent study has sug- 
gested that Helicobacter pylori strQng\y activates the NF-kB pathway, via the TRAF2/6-IKKp pathway, namely, via the 
same pathway shared by TNF-a (Gastroenterology 2000, 119: 97-108). 

[0146] THP-1 cell, which is a human monocyte cell line, was cultured in the presence of TNF-a (Tumor Necrosis 

50 Factor-alpha) or LPS (Lipopolysaccharides). The genes whose expression levels were altered owing to the presence 
of the agent were searched for, and the result showed that the ciones whose expression leveis were increased owing 
to the presence of TNF-a were ADRGL1 00001 80, BRACE20030780, BRACE20077640, BRACE20083850, 
BRAWH20004430, FCBBF1 00061 80, FEBRA20003780, FEBRA20006800, FEBRA20012940, FEBRA2001 5840, 
HEART20004480, HLUNG1 0000370, HLUNG20001 1 60, HSYRA2001 3320, IMR3220008380, KIDNE1 0001 520, 

55 KIDNE20040540, KIDNE20061490, KIDNE20062990, NT2N El 0001 630, NT2NE20003920, NT2NE20005500, 
NT2RI20014500, NT2RI2001 6570, NT2RI20078270, NT2RI20083360, NTONG1 0002570, PUAEN1 0003220, 
SKNMG1 0000290, STOMA20002570, TESTI20011340, UTERU20004850. 

[01 47] On the other hand, the clones whose expression levels were decreased owing to the presence of TN F-a were 
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BRACE20013400, BRACE20091880, HEART20005060, HLUNG20001760, IMR3220008590, NT2NE1 0001 850, 
NT2RI2001B660, NT2R 120053350, NT2RI20070480, PLACE600473B0, STOMA20002B90, SYNOV20001770, 
TRACH20001960. 

[0148] Further, the clones whose expression levels were increased owing to the presence of LPS were 
5 FCBBF1 00061 80, FEBRA20015840, HLUNG1 0000370, HLUNG20001160, HSYRA20013320, KIDNE20040540, 
KIDNE20061490, NT2NE1 0001 630, NT2NE20003920, NT2NE20005500, NT2RI2001 4500, NT2RI2001 6570, 
NT2RI20078270, NT0NG1 0002570, PUAEN1 0003220, STOMA20002570, TESTI20011340. On the other hand, the 
clones whose expression levels were decreased owing to the presence of LPS were BRACE20013400, 
BRACE20091880, HEART20005060, HLUNG20001 760, NT2Ri200704B0, UMVEN20001330. 
10 [0149] These clones are involved in inflammation. 

[0150] MKN45, which is a gastric cancer cell line, was co-cultured with Helicobacter pylori. The genes whose ex- 
pression levels were altered owing to the presence of Helicobacter pylori were searched for, and the result showed 
that the clones whose expression levels were increased owing to the presence of Helicobacter pylori were 
BRACE1 0001 590, BRACE20079530, BRAWH1 0001 620, FEBRA20006800, KIDNE20003490, KiDNE20040540, 
15 KiDNE20050420, NT2NE1 0001 850, STOMA20002890, SYNOV20001 770, TESTil 0000550, UTERU20004850. On 
the other hand, the clones whose expression levels were decreased owing to the presence of Helicobacter pylori were 
BRAGE20034490, BRACE20077640, BRACE20Q83850, KIDNE20005170, LiVER20O00330, NT2RP60000390, 
NT0NG1 0000980, Ui\/1VEN20001330. 

[0151] These clones are involved in gastritis or gastroduodenai ulcer 

20 [01 52] For example, if the polypeptide encoded by the cDNA of the present invention is a regulatory factor of cellular 
conditions such as growth and differentiation, it can be used for developing medicines as follows. The polypeptide or 
antibody provided by the Invention is injected into a certain kind of ceils by microinjection. Then, using the cells, it is 
possible to screen low molecular weight compounds, etc. by measuring the change in the cellular conditions, or the 
activation or inhibition of a particular gene. The screening can be performed as follows. 

25 [01 53] First, the polypeptide is expressed and purified as recombinant. The purified polypeptide is microinjected into 
cells such as various cell lines, or primary culture cells, and the cellular change such as growth and differentiation can 
be examined. Alternatively, the induction of genes whose expression is l<nown to be involved in a particular change of 
ceiiuiar conditions may be detected by the amount of mRNA or polypeptide. Alternatively, the amount of intracellular 
molecules (low molecular weight compounds, etc.) that is changed by the function of the gene product (polypeptide) 

30 which Is known to be involved in a particular change of cellular conditions may be detected. The compounds to be 
screened (both low and high molecular compounds are acceptable) can be added to the culture media and assessed 
for their activity by measuring the change of the cellular conditions. 

[0154] Instead of microinjection, cell lines introduced with the gene obtained in the invention can be used for the 
screening. If the gene product Is turn out to be Involved in a particular change In the cellular conditions, the change of 
35 the product can be used as a measurement for screening. Once a compound is screened out which can activate or 
inhibit the function of the polypeptide of the invention, it can be applied for developing medicines. 
[0155] If the polypeptide encoded by thecDNA of the present invention is a secretory protein, membrane protein, or 
protein involved in signal transduction, glycoprotein, transcription, or diseases. It can be used in functional assays for 
developing medicines. 

40 [0156] in case of a membrane protein, it is most likely to be a polypeptide that functions as a receptor or ligand on 
the cell surface. Therefore, It is possible to reveal a new relationship between a ligand and receptor by screening the 
membrane protein of the invention based on the binding activity with the known ligand or receptor. Screening can be 
performed according to the known methods. 

[0157] For example, a ligand against the polypeptide of the invention can be screened in the following manner 
45 Namely, a ligand that binds to a specific polypeptide can be screened by a method comprising the steps of: (a) contacting 
a test sample with the polypeptide of the invention or a partial peptide thereof, or cells expressing these, and (b) 
selecting a test sample that binds to said polypeptide, said partial peptide, or said cells. 

[0158] On the other hand, for example, screening using cells expressing the polypeptide of the present invention 
that is a receptor protein can also be performed as follows. It is possible to screen receptors that is capable of binding 

50 to a specific polypeptide by using procedures (a) attaching the sample cells to the polypeptide of the invention or Its 
partial peptide, and (b) selecting cells that can bind to the said polypeptide or its partial peptide. 
[0159] In a following screening as an example, first the polypeptide of the invention is expressed, and the recombinant 
polypeptide is purified. Next, the purified polypeptide is labeled, binding assay is performed using a various cell lines 
or primary cultured cells, and cells that are expressing a receptor are selected (Growth and differentiation factors and 

55 their receptors, Shin-Seikagaku Jikken Kouza Vol.7 (1991) Honjyo, Arai, Taniguchi, and IVIuramatsu edit, p203-236, 
Tokyo-Kagaku-Doujin). A polypeptide of the invention can be labeled with Rl such as '^^^l, and enzyme (alkaline phos- 
phatase etc.). 

[01 60] Alternatively, a polypeptide of the invention may be used without labeling and then detected by using a labeled 
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antibody against the polypeptide. The cells that are selected by the above screening methods, which express a receptor 
of the polypeptide of the invention, can be used for the further screening of an agonists or antagonists of the said 
receptor. 

[0161] Once the ligand binding to the polypeptide of the invention, the receptor of the polypeptide of the invention 
5 or the cells expressing the receptor are obtained by screening, it is possible to screen a compound that binds to the 
ligand and receptor. Also it is possible to screen a compound that can inhibit both bindings (agonists or antagonists of 

the receptor, for example) by utilizing the binding activities. 

[01 62] When the polypeptide of the invention Is a receptor, the screening method comprises the steps of (a) contacting 
the polypeptide of the invention or cells expressing the polypeptide of the invention with the ligand, in the presence of 

10 a test sample, (b) detecting the binding activity between said polypeptide or cells expressing said polypeptide and the 
ligand, and (c) selecting a compound that reduces said binding activity when compared to the activity in the absence 
of the test sample. Furthennore, when the polypeptide of the invention Is a ligand, the screening method comprises 
the steps of (a) contacting the polypeptide of the invention with its receptor or cells expressing the receptor in the 
presence of samples, (b) detecting the binding activity between the polypeptide and its receptor or the cells expressing 

15 the receptor, and (c) selecting a compound that can potentially reduce the binding activity compared to the activity in 
the absence of the sample. 

[0163] Samples to screen include cell extracts, expressed products from a gene library, synthesized low molecular 
compound, synthesized peptide, and natural compounds, for example, but are not construed to be listed here. A com- 
pound that is isolated by the above screening using a binding activity of the polypeptide of the Invention can also be 
20 used as a sample. 

[0164] A compound Isolated by the screening may be a candidate to be an agonist or an antagonist of the receptor 
of the polypeptide. By utilizing an assay that monitors a change In the Intracellular signaling such as phosphorylation 
which results from reduction of the binding between the polypeptide and Its receptor. It is possible to Identify whether 
the obtained compound is an agonist or antagonist of the receptor. Also, the compound may be a candidate of a 
25 molecule that can inhibit the Interaction between the polypeptide and its associated proteins (Including a receptor) in 
vivo. Such compounds can be used for developing drugs for precaution or cures of a disease in which the polypeptide 
is involved. 

[0165] Secretory proteins may regulate cellular conditions such as growth and differentiation. It Is possible to find 
out a novel factor that regulates cellular conditions by adding the secretory protein of the Invention to a certain kind of 
30 cell, and performing a screening by utilizing the cellular changes in growth or differentiation, or activation of a particular 
gene. 

[0166] The screening can be performed, for example, as follows. First, the polypeptide of the invention is expressed 
and purified in a recombinant form. Then, the purified polypeptide is added to a various kind of cell lines or primary 
cultured cells, and the change In the cell growth and differentiation Is monitored. The Induction of a particular gene 

35 that is known to be involved in a certain cellular change is detected by the amounts of mRNA and polypeptide. Alter- 
natively, the amount of an Intracellular molecule (low-molecular-welght compounds, etc.) that Is changed by the function 
of a gene product (polypeptide) that is known to function in a certain cellular change is used for the detection. 
[0167] Once the screening reveals that the polypeptide of the invention can regulate cellular conditions or the func- 
tions, it is possible to apply the polypeptide as a pharmaceutical and diagnostic medicine for related diseases by itself 

40 or by altering a part of it into an appropriate composition. 

[0168] As is above described for membrane proteins, the secretory protein provided by the invention may be used 
to explore a novel ligand-receptor interaction using a screening based on the binding activity to a known ligand or 
receptor A similar method can be used to identify an agonist or antagonist. The resulting compounds obtained by the 
methods can be a candidate of a compound that can inhibit the interaction between the polypeptide of the Invention 

45 and an interacting molecule (including a receptor). The compounds may be able to use as a preventive, therapeutic, 
and diagnostic medicine for the diseases, in which the polypeptide may play a certain role. 

[0169] Proteins involved in signal transduction or transcription may be a factor that affects a certain polypeptide or 
gene In response to intracellular/extracellular stimuli. It Is possible to find out a novel factorthat can affect a polypeptide 
or gene by expressing the polypeptide provided by the invention in a certain types of cells, and performing a screening 

50 utilizing the activation of a certain intracellular polypeptide or gene. 

[0170] The screening may be performed as follows. First, a transfonned cell line expressing the polypeptide is ob- 
tained. Then, the transformed cell line and the untransformed original cell line are compared for the changes in the 
expression of a certain gene by detecting the amount of its mRNA or polypeptide. Alternatively, the amount of an 
intracellular molecule (low molecular weight compounds, etc) that is changed by the function of a certain gene product 

55 (polypeptide) may be used for the detection. Furthermore, the change of the expression of a certain gene can be 
detected by introducing a fusion gene that comprises a regulatory region of the gene and a marker gene (luciferase, 
p-galactosidase, etc.) into a cell, expressing the polypeptide provided by the invention into the cell, and estimating the 
activity of a marker gene product (polypeptide). 
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[01 71] If the polypeptide or gene of the invention is involved in diseases, it is possible to screen a gene or compound 
that can regulate its expression and/or activity either directly or indirectly by utilizing the polypeptide of the present 
invention. 

[0172] For example, the polypeptide of the invention is expressed and purified as a recombinant polypeptide. Then, 

5 the polypeptide or gene that interacts with the polypeptide of the Invention is purified, and screened based on the 
binding. Alternatively, the screening can be performed by adding with a compound of a candidate of the inhibitor added 
in advance and monitoring the change of binding activity, in another method, a transcription regulatory region locating 
in the 5'-upstream of the gene encoding the polypeptide of the invention that is capable of regulating the expression 
of other genes is obtained, and fused with a marl<er gene. The fusion is introduced into a cell, and the cell is added 

10 with compounds to explore a regulatory factor of the expression of the said gene. 

[0173] The compound obtained by the screening can be used for developing phamnaceutical and diagnostic medi- 
cines for the diseases in which the polypeptide of the present invention is involved. Similarly, if the regulatory factor 
obtained in the screening is turn out to be a polypeptide, compounds that can newly affect the expression or activity 
of the polypeptide may be used as a medicine for the diseases in which the polypeptide of the invention is Involved. 

15 [0174] If the polypeptide of the invention has an enzymatic activity, regardless as to whether it is a secretory protein, 
membrane protein, or proteins Involved in signal transduction, glycoprotein, transcription, or diseases, a screening 
may be performed by adding a compound to the polypeptide of the invention and monitoring the change of the com- 
pound. The enzymatic activity may also be utilized to screen a compound that can inhibit the activity of the polypeptide. 
[0175] In a screening given as an example, the polypeptide of the invention is expressed and the recombinant 

20 polypeptide is purified, Then, compounds are contacted with the purified polypeptide, and the amount of the compound 
and the reaction products is examined. Alternatively, compounds that are candidates of an inhibitor are pretreated, 
then a compound (substrate) that can react with the purified polypeptide is added, and the amount of the substrate 
and the reaction products is examined. 

[0176] The compounds obtained in the screening may be used as a medicine for diseases in which the polypeptide 
25 of the invention is involved. Also they can be applied for tests that examine whether the polypeptide of the invention 
functions normally in vivo. 

[0177] Whether the secretory protein, membrane protein, signal transduction-related protein, glycoprotein-related 
protein, or transcription-related protein of the present invention is a novel protein involved in diseases or not is deter- 
mined in another method than described above, by obtaining a specific antibodyagainstthepolypeptideof the invention, 
30 and examining the relationship between the expression or activity of the polypeptide and a certain disease. In an 
alternative way, it may be analyzed refen-ed to the methods in "Molecular Diagnosis of Genetic Diseases" (Elles R. 
edit, (1996) in the series of "Method in Molecular Biology" (Humana Press). 

[01 78] Proteins involved in diseases are targets of screening as mentioned, and thus are very useful in developing 
drugs which regulate their expression and activity. Also, the proteins are useful in the medicinal industry as a diagnostic 

35 marl<er of the related disease or a target of gene therapy. 

[0179] Compounds isolated as mentioned above can be administered patients as it is, or after fonnulated into a 
pharmaceutical composition according to the known methods. For example, a pharmaceutically acceptable carrier or 
vehicle, specifically sterilized water, saline, plant oil, emulsifier, or suspending agent can be mixed with the compounds 
appropriately. The pharmaceutical compositions can be administered to patients by a method known to those skilled 

40 in the art, such as intraarterial, intravenous, or subcutaneous injections. The dosage may vary depending on the weight 
or age of a patient, or the method of administration, but those skilled in the art can choose an appropriate dosage 
properly. If the compound is encoded by polynucleotide, the polynucleotide can be cloned into a vector for gene therapy, 
and used for gene therapy. The dosage of the polynucleotide and the method of its administration may vary depending 
on the weight or age of a patient, or the symptoms, but those skilled in the art can choose properly 

45 [01 80] The present invention further relates to databases comprising at least a sequence of polynucleotide and/or 
polypeptide, or a medium recorded in such databases, selected from the sequence data of the nucleotide and/or the 
amino acids indicated in Table 1 . The tenn "database" means a set of accumulated information as machine-searchable 
and readable information of nucleotide sequence. The databases of the present invention comprise at least one of the 
novel nucleotide sequences of polynucleotides provided by the present invention. The databases of the present inven- 

50 tion can consist of only the sequence data of the novel polynucleotides provided by the present invention or can com- 
prise other Information on nucleotide sequences of known full-length cDNAs or ESTs. The databases of the present 
invention can be comprised of not only the information on the nucleotide sequences but also the information on the 
gene functions revealed by the present Invention. Additional infonnation such as names of DNA clones carrying the 
full-length cDNAs can be recorded or linked together with the sequence data in the databases. 

55 [0181] The database of the present invention is useful for gaining complete gene sequence infonnation from partial 
sequence information of a gene of interest. The database of the present invention comprises nucleotide sequence 
infonnation of full-length cDNAs. Consequently, by comparing the infonnation in this database with the nucleotide 
sequence of a partial gene fragment yielded by differential display method or subtraction method, the information on 
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the full-length nucleotide sequence of interest can be gained from the sequence of the partial fragment as a starting clue. 
[0182] The sequence information of the full-length cDNAs constituting the database of the present invention contains 
not only the information on the complete sequences but also extra infomiation on expression frequency of the genes 
as well as homology of the genes to known genes and known polypeptides. Thus the extra Information facilitates rapid 
5 functional analyses of partial gene fragments. Further, the information on human genes is accumulated in the database 
of the present invention, and therefore, the database is useful for isolating a human homologue of a gene originating 
from other species. The human homologue can be isolated based on the nucleotide sequence of the gene from the 
original species. 

[0183] At present, information on a wide variety of gene fragments can be obtained by differential display method 
10 and subtraction method. In general, these gene fragments are utilized as tools for isolating the full-length sequences 
thereof. When the gene fragment corresponds to an already-known gene, the full-length sequence is easily obtained 
by comparing the partial sequence with the information in known databases. However, when there exists no information 
corresponding to the partial sequence of interest in the known databases, cDNA cloning should be carried out for the 
full-length cDNA. It is often difficult to obtain the full-length nucleotide sequence using the partial sequence information 
15 as an initial clue. If the full-length of the gene is not available, the amino acid sequence of the polypeptide encoded by 
the gene remains unidentified. Thus the database of the present invention can contribute to the identification of full- 
length cDNAs corresponding to gene fragments, which cannot be revealed by using databases of known genes. 
[0184] The present invention has provided 1639 polynucleotides. As has not yet proceeded the isolation of full-length 
cDNA within the human, the invention has great significance. It is known that secretory proteins, membrane proteins, 
20 signal transduction-related proteins, glycoprotein-related proteins, transcription-related proteins, and so on are involved 
in many diseases. The genes and proteins involved in diseases are useful for developing a diagnostic marker or med- 
icines for regulation of their expression and activity, or as a target of gene therapy. 

[0185] in particular, cDNA assumed to encode secretory proteins, which were provided by this invention, are very 
important for the industry since the encoded proteins themselves are expected to be useful as pharmaceutical agents 
25 and many disease-related genes may be included in them. In addition, membrane proteins, signal transduction-related 
proteins, transcription-related proteins, disease-related proteins, and genes encoding them can be used as indicators 
for diseases, etc. These cDNA are also very important for the industry, which are expected to regulate the activity or 
expression of the encoded protein to treat diseases, etc. 

[0186] Any patents, patent applications, and publications cited herein are incorporated by reference. 
30 [0187] The invention is illustrated more specifically with reference to the following examples, but Is nottobeconstrued 
as being limited thereto. 

EXAMPLE 1 

35 Preparation of cDNA library by oligo-capping 

(1) Extraction and purchase of mRNA 

[0188] Total RNAs as mRNA sources were extracted from human tissues (shown below) by the method as described 
40 in the reference (J, Sambrook, E. F. Fritsch & T. IVIaniatis, Molecular Cloning Second edition, Cold Spring harbor Lab- 
oratory Press, 1989). Further, by the method as described in the reference (J. Sambrook, E. F Fritsch & T. Maniatis, 
Molecular Cloning Second edition. Cold Spring harbor Laboratory Press, 1989), total RNAs as mRNA sources were 
extracted from human culture cells and human primary culture cells (shown below) which had been cultivated by the 
methods described in the catalogs. 
45 [0189] The library names and the origins are indicated below in the order of "Library name: Origin". When a library 
was prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

<Extraction of mRNA from human tissues> 

50 NTONG: Normal tongue; 

CTONG: Tongue cancer; 
FCBBF: Fetal brain; 
OCBBF: Fetal brain; 
PLACE: Placenta; 

55 SYNOV: Synovial membrane tissue (from rheumatioid arthritis). 



<Extraction of mRNA from culture cells> 



EP 1 293 569 A2 



BNGH4: H4 cells (ATCC #HTB-148) ; 
IMR32: IMR32 cells (ATCC #CCL-127); 
SKNMC: SK-N-MC cells (ATCC #HTB-10) ; 
3NB69: NB69 cells (RCB #RCB0480) ; 
5 BGGI1 : GI1 cells (RCB #RCB0763); 

NB9N4: NB9 cells (RCB #RCB0477) ; 
SKNSH: SK-N-SH cells (RCB #RCB0426) ; 
NT2RIVI: NT2 cells (STARATAGENE #204101); 

NT2RP: NT2 cells treated with retinolc acid (RA) for 5 weeks to induce the differentiation; 
10 NT2RI: NT2 cells treated with RA for 5 weeks to induce the differentiation, followed by the treatment with the 

growth inhibitor for 2 weeks; 

NT2NE: NT2 cells were treated with RA and the growth inhibitor for the neuronal differentiation, and the re- 
sultant neurons were concentrated and harvested (NT2 Neuron) ; 

NTISIVI: NT2 cells (STARATAGENE #204101) were treated with RA for 5 weeks to induce the differentiation, 
15 and then treated with the growth inhibitor for 2 weeks; mRNA was prepared from the cells and a cDNA library 

was constructed from the mRNA; the cDNAs of the library whose nucleotide sequences were shared by those 
of mRNAs from undifferentiated NT2 cells were subtracted by using a Subtract Kit (Invitrogen #K4320-01); 
the subtracted library (NT2RI-NT2RM) was provided by this procedure. 

20 RCB Indicates that the cell was provided by the Cell Bank, RIKEN GENE BANK, The Institute of Physical 

and Chemical Research; ATCC indicates that the cell was provided by American Type Culture Collection. 
<Extraction of mRNA from primary culture cells> 

ASTRO: Normal human astrocyte NHA5732, Takara Shuzo #CC2565; 
25 DFNES: Nonnal human denxial fibroblast (neonatal skin); NHDF-Neo NHDF2564, Takara Shuzo #CC2509; 

MESAN: Normal human mesangial cell NHMC56046-2, Takara Shuzo #CC2559; 

NHNPC: Normal human neural progenitor cell NHNP5958, Takara Shuzo #CC2599; 

PEBLM: Normal human peripheral blood mononuclear cell HPBMC5939, Takara Shuzo #CC2702; 

HSYRA: Human synoviocyte HS-RA (from rheumatioid arthritis), Toyobo #T404K-05; 
30 PUAEN: Normal human pulmonary artery endothelial cells, Toyobo #T302K-05; 

UMVEN: Normal human umbilical vein endothelial cell HUVEC, Toyobo #T200K-05; 

HCASM: Normal human coronary artery smooth muscle cell HCASMC, Toyobo #T305K-05; 

HCHON; Normal human chondrocyte HC, Toyobo #T402K-05; 

HHDPC: Normal human demial papilla cell HDPC, Toyobo #THPCK-001 ; 
35 CD34C: CD34+ cells (AllCells, LLC #CB14435M) ; 

D30ST: CD34+ cells treated with the osteoclast differentiation factor (ODF) for 3 days to induce the differen- 
tiation; 

D60ST: CD34+ cells treated with ODF for 6 days to induce the differentiation; 
D90ST: CD34+ cells treated with ODF for 9 days to induce the differentiation. 

[0190] Then, total RNAs extracted from the following human tissues were purchased and used as mRNA sources, 
The library names and the origins are indicated below in the order of "Library name: Origin". When a library was 
prepared by the subtraction method, the item is followed by a description of how to prepare the subtracted library. 

45 <Purchase of total RNA containing mRNA extracted from human tissues> 

ADRGL: Adrenal gland, CLONTECH #64016-1 ; 
BRACE: Brain (cerebellum), CLONTECH #64035-1 ; 
BRAWH: Whole brain, CLONTECH #64020-1 ; 

50 FEBRA: Fetal brain, CLONTECH #64019-1 ; 

FELIV: Fetal liver, CLONTECH #64018-1; 

HEART Heart, CLONTECH #64025-1; 

HLUNG: Lung, CLONTECH #64023-1 ; 

KIDNE: Kidney CLONTECH #64030-1; 
55 LIVER: Liver, CLONTECH #64022-1 ; 

IVIAMGL: Mammary Gland, CLONTECH #64037-1 ; 

PANCR: Pancreas, CLONTECH #64031-1; 

PROST Prostate, CLONTECH #64038-1 ; 
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SALGL: Salivary Gland, CLONTECH #64026-1 ; 

SKMUS; Skeletal Muscle, CLONTECH #64033-1; 

SMINT: Small Intestine, CLONTECH #64039-1 ; 

SPLEN: Spleen, CLONTECH #64034-1; 
5 STOMA: Stomacfi, CLONTECH #64090-1 ; 

TBAES: Breast (Tumor), CLONTECH #64015-1; 

TCERX: Cen/ix (Tumor), CLONTECH #64010-1; 

TCOLN: Colon (Tumor), CLONTECH #64014-1; 

TESTI: Testis, CLONTECH #64027-1 ; 
10 THYMU; Tlnymus, CLONTECH #64028-1 ; 

TLUNG: Lung (Tumor), CLONTECH #64013-1; 

TOVAR: Ovary (Tumor), CLONTECH #64011-1 ; 

TRACH: Trachea, CLONTECH #64091-1 ; 

TUTER: Uterus (Tumor), CLONTECH #64008-1; 
15 UTERU: Uterus, CLONTECH #64029-1 ; 

ADIPS: Adipose, Invitrogen #D6005-01; 

BLADE: Bladder, Invitrogen #06020-01; 

BRALZ: Cerebral cortex from an Alzheimer patient (Brain, cortex, Alzheimer), Invitrogen #D6830-01 ; 

CERVX: Cervix, Invitrogen #06047-01; 
20 COLON: Colon, Invitrogen #D6050-0; 

NESOP: Esophagus, Invitrogen #D6060-01; 

PERIC: Pericardium, Invitrogen #06105-01; 

RECTIVI: Rectum, Invitrogen #06110-01; 

TESOP: Esophageal (Tumor), Invitrogen #06860-01; 
25 TKIDN: Kidney (Tumor), Invitrogen #06870-01 ; 

TLIVE: Liver (Tumor), Invitrogen #06880-01 ; 

TSTOIVI: Stomach (Tumor). Invitrogen #D6920-01; 

BEAST Adult breast, STARATAGENE #735044; 

FEHRT Fetal heart, STARATAGENE #738012; 
30 FEKID: Fetal kidney, STARATAGENE #738014; 

FELNG: Fetal lung, STARATAGENE #738020; 

NOVAR: Adult ovary, STARATAGENE #735260; 

BRASW: subtracted library (BRALZ-BRAWH). 

35 [01 91 ] A cDNA library was constructed from mRNA prepared from tissues of cerebral cortex obtained from an Alzhe- 
imer patient [BRALZ: Cerebral cortex from an Alzheimer patient (Brain, cortex, Alzheimer), invitrogen #06830-01]; the 
cDNAs of this library whose nucleotide sequences were shared by those of mRNAs from whole brain tissue [BRAWH; 
Whole brain, CLONTECH #64020-1] were subtracted by using a Subtract Kit (Invitrogen #K4320-01), 
[0192] Further, mRNAs extracted and purified as poly A(+) RNAs from the human tissues shown below were pur- 

40 chased, A cDNA library was prepared from an RNA mixture in which the poly A(-i-) RNA from each tissue had been 
combined with polyA(-) RNA. The poly A(-) RNA was prepared by removing polyA(-i-) RNA from the total RNA of whole 
brain tissue (CLONTECH #64020-1) by using oligo dT cellulose. The library names and the origins are indicated below 
in the order of "Library name; Origin". 

45 <Purchase of mRNAs of human tissues as poly A(+) RNAs> 

BRAMY: Brain (amygdala), CLONTECH #6574-1 ; 
BRCAN: Brain (caudate nucleus), CLONTECH #6575-1; 
BRCOC: Brain (corpus callosum), CLONTECH #6577-1 ; 
50 BRHIP: Brain (hippocampus), CLONTECH #6578-1 ; 

BRSSN: Brain (substantia nigra), CLONTECH #6580-1; 
BRSTN: Brain (subthalamic nucleus), CLONTECH #6581-1 ; 
BRTHA: Brain (thalamus), CLONTECH #6582-1. 

S5 (2) Preparation of cDNA library 

[0193] cDNA library was prepared from each RNA by the improved method (WO 01/04286) of oligo capping [M. 
MaruyamaandS. Sugano, Gene, 138: 171-174(1994)]. A series of procedures, BAP (Bacterial Alkaline Phosphatase) 
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treatment, TAP (Tobacco Acid Pyrophosphatase) treatment, RNA ligation, first strand cDNA synthesis and RNA re- 
moval, were carried out using the oligo-cap linker (SEQ ID NO: 3279) and oligo dT primer (SEQ ID NO: 3280), as 
described in WO 01/04286. Then, the single-stranded cDNA was converted to a double-stranded cDNA by PGR 
(polymerase chain reaction) using 5' (SEQ ID NO: 3281) and 3' (SEQ ID NO: 3282) PGR primers, and then digested 
5 with Sf;l. Then, a fraction of cDNA fragments, typically 2-kb or longer (3-kb or longer in some cases) , was unidirec- 
tionally cloned into a Oral 1 1 -digested pMEl 8SFL3 vector (Figure 1 ) (GenBank AB009864, Expression vector); the cDNA 
library was thus prepared. 

[0194] The names of cDNA libraries, which were used in the analysis of full-length cDNA sequences, and their origins 
are shown in Table 2. 



Table 2 





Library 


Type 


Origin, etc. 






Culture cell 


NB69 cells (RGB #RGB0480) 




ADRGL 


Tissue 


Adrenal gland (GLONTEGH #64016-1) 




ASTRO 


Primary culture cell 


Normal Human Astrocyte NHA5732 (Takara Shuzo #GG2565) 




BGGI1 


Culture cell 


Gil cells (RGB #RGB0763) 


20 


BNGH4 


Culture cell 


H4 cells (ATGG #HTB-1 48) 






Tissue 


Brain, cerebellum (GLONTEGH #64035-1) 




BRAWH 


Tissue 


Brain, whole (GLONTEGH #64020-1) 






Primary culture cell 


CD34+ cells (AllCells, LLC #CB14435M) 


25 


CTONG 


Tissue 


Tongue, Cancer 




D30ST 


Primary culture cell 


CD34+ cells (ODF induction for 3 days) 


30 




Primary culture cell 


Normal Human Dermal Fibroblasts (Neonatal Skin); NHDF-Neo NHDF2564 
(Takara Shuzo #GG2509) 


FCBBF 


Tissue 


Brain, Fetal 




FEBRA 


Tissue 


Brain, Fetai (CLONTECH #64019-1) 




HCASM 


Primary culture cell 


Human coronary artery smooth muscle cells HCASMC (Toyobo #T305K-05) 


35 


HEART 


Tissue 


Heart (CLONTECH #64025-1) 




HHDPC 


Primary culture cell 


Human demnal papilla cells HDPC (Toyobo #THPCK-001) 




HLUNG 


Tissue 


Lung (GLONTEGH #64023-1) 




HSYRA 


Primary culture cell 


Human synoviocytes from rheumatioid arthritis HS-RA(Toyobo #T404K-05) 




IMR32 


Culture cell 


IMR32 cells (ATGG #GGL-127) 




KIDNE 


Tissue 


Kidney (GLONTEGH #64030-1) 




LIVER 


Tissue 


Liver (CLONTECH #64022-1) 




MAMGL 


Tissue 


Mammary Gland (CLONTECH #64037-1) 




MESAN 


Primary culture cell 


Normal human mesarigial cells NHIV1C56046-2 (Takara Shuzo #CC2559) 




NB9N4 


Culture cell 


NB9 cells (RGB #RCB0477) 




NESOP 


Tissue 


Esophagus (Invitrogen #D6060-01) 




NHNPC 


Primary cuiture ceii 


Normal iiuman neural progenitor cells NHNP5958 (Takara Shuzo #CC2599) 




NT2NE 


Cuiturc ccii 


NT2 cells concentrated after differentiation (NT2 Neuron) 




NT2RI 


Culture cell 


NT2 cells treated by growth inhibitor for 2 weeks after RA induction for 5 weeks 


55 


NT2RP 


Culture cell 


NT2 cells treated by RA for 5 weeks 




NTONG 


Tissue 


Tongue 
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Table 2 (continued) 



Library 


Type 


Origin, etc. 


OCBBF 


Tissue 


Brain, Fetai 


PANCR 


Tissue 


Pancreas (GLONTECH #64031-1) 


PEBLM 


Primary cuiture ceii 


iHuman peripheral blood mononuclear celis HPBMC5939 (Tai<ara Shiuzo 
#CC2702) 






Placenta 




Tissue 


Prostate (GLONTECH #64038-1) 




Primary cuiture ceii 


Human pulmonary artery endothelial cells (Toyobo #T302K-05) 




Tissue 


Salivary Giand (GLONTECH #64026-1) 


SKiVlUS 




Siceietai iViuscie (GLONTECH #64033-1) 


SKNi\^C 


Cuiture ceii 


SK-N-MG ceils (ATGG #HTB-10) 


SKNSiH 


Cuiture ceii 


SK-N-SH celis (RGB #RCB0426) 


SMiNT 




Small Intestine (GLONTECH #64039-1) 




Tissue 


Spleen (GLONTECH #64034-1) 


STOiVlA 


Tissue 


Stomach (GLONTECH #64090-1) 


SYNOV 






TESTI 


Tissue 


Testis (GLONTECH #64027-1) 


THYIVIU 


Tissue 


Thymus (GLONTECH #64028-1) 


TRACH 


Tissue 


Trachea (GLONTECH #64091-1) 


UMVEN 


Primary cuiture ceii 


Human umblilcai vein endothelial cells HUVEC (Toyobo #T200K-05) 


UTERU 


Tissue 


Uterus (GLONTECH #64021-1) 



[01 95] The cDNA library with the high fullness ratio (the fullness ratio of 5'-end, which was calculated for each cDNA 
library by using the protein coding region found in known mRNA species as an index, was 90% in average) prepared 
by the improved oilgo-capping method was constructed by using a eukaryotic expression vector pME18SFL3. The 
vector contains SR a promoter and SV40 small t Intron In the upstream of the cloning site, and SV40 polyA added 
signal sequence site in the downstream. As the cloning site of piVIEI 8SFL3 has asymmetrical Dralll sites, and the ends 
of cDNA fragments contain Sfil sites complementary to the Dralll sites, the cloned cDNA fragments can be inserted 
into the downstream of the SR a promoter unidlrectionally. Therefore, clones containing full-length cDNA can be ex- 
pressed transiently by introducing the obtained plasmid directly Into COS cells, etc. Thus, the clones can be analyzed 
very easily in ternis of the proteins that are the gene products of the clones, or In terms of the biological activities of 
the proteins. 



(3) Assessment of the 5'-end completeness of clones derived from the cDNA library prepared by oligo-capping 

[01 96] With respect to the plasmid DNAs of clones derived from the libraries, the nucleotide sequences of cDNA 5'- 
ends (3' -ends as well in some cases) were determined in a DNA sequencer (ABI PRISM 3700, PE Biosystems), after 
sequencing reaction was conducted by using a DNA sequencing reagent (BIgDye Terminator Cycle Sequencing FS 
Ready Reaction Kit, PE Biosystems) according to the manual. A database was constructed based on the obtained data. 
[0197] The 5'-end completeness of about 770,000 clones derived from the human cDNA libraries prepared by the 
Improved ollgo-capping method was determined by the following method. The clones whose 5'-end sequences were 
consistentwith those of known human mRNA in the public database were judged to be "full-length" if they had a longer 
5'-end sequence than that of the known human mRNA; or even though the 5'-end sequence was shorter, If It contained 
the translation initiation codon it was judged to have the "full-length" sequence. Clones which did not contain the trans- 
lation initiation codon were judged to be "not-full-length". The fullness ratio ((the number of full-length clones)/(the 
number of full-length and not-full-length clones)) at the 5'-end of the cDNA clones was determined by comparing with 
known human mRNA. As a result, the fullness ratio of the 5'-ends was 90%. The result indicates that the fullness ratio 
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at the 5'-end sequence was extremely high in the human cDNA clones obtained by the oligo-capping method. 
EXAMPLE 2 

5 Sequencing analysis of cDNA ends and selection of full-length clones 

[0198] With respect to the plasmid DNAs of clones obtained from eachcDNA library, the 5'-end nucleotide sequences 
of the cDNAs were determined in a DNA sequencer (ABI PRISM 3700, PE Biosystems), after sequencing reaction 
was conducted by using a DNA sequencing reagent (Dye Terminator Cycle Sequencing FS Ready Reaction Kit, dRhod- 

10 amine Terminator Cycle Sequencing FS Ready Reaction Kit or BIgDye Terminator Cycle Sequencing FS Ready Re- 
action Kit, PE Biosystems) according to the manual. A database was constructed using the data obtained. 
[0199] For the analyzed 5'-end sequences of cDNA clones, the data with the annotation of "complete cds" In the 
GenBank and UnlGene were searched by BLAST homology search. When Identical to certain human mRNA sequenc- 
es, such oDNA clones were excluded. Then, clustering was carried out. When the identity was 90% or higher, and the 

'5 length of consensus sequence was 50 base pairs or longer, the cDNA ciones were assumed to belong to an identicai 
cluster, and thus clustered. cDNA ciones longer in the 5' direction were selected from the members belonging to a 
cluster; if required, the 3'-end sequences of the selected clones were determined by the same analysis method as 
used to determine the 5'-end sequences. The data of the end sequences obtained were analyzed, and then the clones 
forming a sequence contig at 5'- and 3'-ends were excluded. Further, as mentioned above, the data was analyzed 

20 again by BLAST homology search; when identical to certain human mRNA sequences (Including sequences patented 
and applied for), the cDNA clones were excluded. Thus, the cDNAs clones to be analyzed fortheir nucleotide sequence 
were obtained. 

EXAMPLES 

25 

Analysis of the full-length nucleotide sequences 

[0200] The full-length nucleotide sequences of the selected clones were determined. The nucleotide sequence de- 
termination was mainly performed by primer walking method comprising the dideoxy terminator method using custom- 
30 made synthetic DNA primers. Namely, the nucleotide sequences of the DNAs were determined In a sequencer from 
PE Biosystems, after sequencing reaction was carried out with a DNA sequencing reagent from the same supplier 
using the custom-made synthetic DNA primers according to the manual. A part of the clones were analyzed with a 
DNA sequencer from Licor. 

[0201] Further, the nucleotide sequences of a part of the clones were determined by the shotgun method where the 
35 plasmlds containing the cDNAs were digested at random were used, instead of the use of custom-made primers, by 
the same method in the DNA sequencer. The full-length nucleotide sequences were finally detemined by completely 
assembling the partial nucleotide sequences obtained by the above method. 

[0202] Then, the regions translatable to proteins were deduced from the detennined full-length nucleotide sequences, 
and thereby the amino acid sequences were determined. SEQ ID NOs corresponding to the respective sequences are 

40 shown in Table 1 . 

EXAMPLE 4 

Functional prediction by homology search 

45 

[0203] For the detennined nucleotide sequences, GenBank, SwissProt, UniGene, and nr were searched by BLAST. 
The clones exhibiting higher homology, which were convenient to predict their functions based on the nucleotide se- 
quences and deduced amino acid sequences, were selected based on the BLAST search hit data whose P value or 
E value was 10"'* or lower and for which the length of consensus sequence x homology = 30 or higher in the amino 
50 acid database search. Further, from them, representative clones were selected, which are shown as Homology Search 
Result Data in the last part herein. Accordingly, the data shown herein are merely the representative data, and the 
moiecuie exhibiting homology to each clone is not limited thereto. Further, with respect to a part of clones, the BLAST 
search hit data that did not meet the criteria as described above are not shown herein. 
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EXAMPLES 

Search for signal sequence, transmembrane domain and other functional domains in the deduced amino acid 
sequences 

5 

[0204] With respect to the amino acid sequences deduced from the full-length nucleotide sequences, the prediction 
was made for the presence of signal sequence at the amino terminus, the presence of transmembrane domain, and 
the presence of functional protein domains (motifs). The signal sequence at the amino terminus was searched for by 
PSORT[K. Nakal & M. Kanehlsa, Genomics, 14: 897-911 (1992)]; the transmembrane domain, by SOSUI [T. HIrokawa 

10 et ai., Bioinformatlcs, 14: 378-379 (1998)] (Mitsui Knowledge Industry); the function domain, by Pfam (http://www. 
sanger.ac.ul</Software/Pfam/index.shtml). The amino acid sequence in which the signal sequence at the amino termi- 
nus or transmembrane domain had been predicted to be present by PSOFIT or SOSUI were assumed to be a secretory 
or membrane protein. Further, when the amino acid sequence hit a certain functional domain by the Pfam functional 
domain search, the protein function can be predicted based on the hit data, for example, by referring to the function 

15 categories on the PROSITE (http://www.expasy.ch/cgl-bln/prosite-list.pl). In addition, the functional domain search can 
also be carried out on the PROSITE. 

[0205] The search results obtained with the respective programs are shown below. 

[0206] The clones whose deduced amino acid sequences were detected to have the signal sequences by PSORT 
are as follows. 

20 ADRGL1 0001 600, BGGI120010970, BNGH410001180, BNGH410001370, BRACE1 0001 690, BRACE2001 0650, 
BRACE20014920, BRACE20079530, BRACE20086550, BRACE200B9600, BRAWH20004430, BRAWH20040950, 
BRAWH20052250, BRAWH20092610, GD34C2000051 0, CTONG20028160, FEBRA20003780, FEBRA20004150, 
FEBRA20006900, FEBRA20008090, FEBRA20012270, FEBRA20015840, FEBRA20020860, FEBRA20021910, 
FEBRA20037070, HHDPG20000950, HLUNG1 0000240, HLUNG20001250, HSYRA20003470, HSYRA20014200, 

25 IMR3210001580, IMR3220007750, IMR3220008590, KIDNE1 0001 430, KIDNE20001670, KIDNE20003300, 
KIDNE20042620, KIDNE20054000, KIDNE20060530, KIDNE20066520, LIVER1 0005420, MAMGL1 0000320, 
NHNPC20002060, NT2NE1 0001 630, NT2NE200 16260, NT2NE20055170, NT2RI20009740, NT2RI2001 5400, 
NT2RI20030110, NT2RI20042840, NT2RI20053350, NT2Ri20070B40, NT2RI20073030, NT2RI20074980, 
NT2RI2007a270. NT2Ri20092890, NT2RP7001591 0, NT2RP70021 51 0, NT2RP70029820, NT2RP70047900, 

30 NT2RP70074220, NT2RP70079250, NT2RP70091680, NT2RP70094290, NT2RP70094980, NT2RP70095070, 
NT0NG1 0000980, NT0NG1 00021 40, NT0NG1 0002570, 0CBBF1 0000420, PANCR1 0000210, PLACE60020B40, 
PLACE60026990, PLACE60043960, PLACE60049930, PLACE60050290, PR0ST1 0005260, PR0ST1 0005360, 
PROST20000360, PROST20029600, PROST20044160, PROST20054260, PROST20058800, SIV1INT1 00001 60, 
SPLEN1 0000910, SPLEN20001340, STOMA20002570, TESTI20026320, TESTI20026980, TESTI20027070, 

35 TESTI20028660, TESTI20042870, TESTI20049940, THYMU1 0000830, UTERU 10001 920, UTERU20003930, 
UTERU20004850 

[0207] The clones whose deduced amino acid sequences were detected to have the transmembrane domains by 
SOSUi are as follows. Numerals indicate the numbers of transmembrane domains detected in the deduced amino acid 
sequences. Of the search result, the clone name and the number of transmembrane domains are demarcated by a 

40 double slash mark (//). 

3NB691 00001 80//4, 3NB691 0000850//1 , 3NB6920000290//2, 3NB6920003300//5, 3NB6920005450//2, 
ADRGL1 00001 80//1, ADRGL1 0001 600//1 , ADRGL20003230//2, BGGI120010970//1 , BNGH41 0000800//2, 
BNGH410001040//2, BNGH41 0001370//1 , BNGH41 0001 980//11 , BFiACE200071 80//1 , BRACE2001 0650//1 , 
BRAGE20011170//2, BRACE20013400//2, BRAGE20013520//2, BRAGE20014230//2, BRACE20014530//1 , 

45 BRAGE20014920//1, BRACE2001 8590//1 , BRACE20022270//1 , BRAGE20026850//1 , BRACE20030780//3, 
BRAGE20031100//10, BRAGE20034490//2, BRAGE20071380//3, BRACE20071 970//1 , BRAGE20072810//2, 
BRAGE20075020//I, BRACE20075380//3, BRACE20076410//1 , BRACE20076850//1 , BRACE20077610//2, 
BRAGE20077640//2, BRAGE20077980//1 , BRACE20086550//1 , BRAGE20089600//1 , BRAGE20091880//1 , 
BRAWH1 000001 0//1, BRAWH10000370//1 , BRAWH10000940//1, BRAWH10001620//1 , BRAWH10001800//1 , 

50 BRAWH20004430//8, BRAWH20006970//1 , BRAWH20011290//4, BRAWH20014380//2, BRAWH2001 5030//2, 
BRAWH20036930//1 , BRAWH20038320//2, BRAWH20059980//1 , BRAWH20087060//1 , BRAWH2009261 0//3, 
CD34C20000510//1, CTONG2001 5330//1 , CTONG200281 60//2, CTONG20037820//1 , CTONG200471 60//4, 
FGBBF1 00061 80//3, FCBBF1 0006750//2, FGBBF2000591 0//1 . FCBBF20009400//3, FCBBF2001 5380//5, 
FEBRA20004040//2, FEBRA20004150//3, FEBRA20004520//3, FEBRA2000491 0//2, FEBRA20006560//3, 

55 FEBRA20008800//1, FEBRA2001 0930//7, FEBRA2001 2450//3, FEBRA20012940//1 , FEBRA2001 351 0//2, 
FEBRA20014870//1, FEBRA20015840//2, FEBRA20020860//2, FEBRA20021 91 0//1 , FEBRA20031550//2, 
FEBRA20041910//1, FEBRA20063150//1 , FEBRA20066670//2, HGASM1 000061 0//2, HGASM20002020//1 , 
HEART20000990//1, HEART20004920//2, HHDPC20000950//2, HLUNG1 0000370//2, HLUNG20001160//1 , 
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HLUNG20001420//12, HLUNG20001760//2, HSYRA20003470//1 , HSYRA20008280//1 , HSYRA20011030//1 , 
HSYRA20015800//2, IMR3210000440//1 , IMR3210001580//2, ll\/IR321 0002660//6, IMR3220008590//1 , 
IMR3220009840//2, KIDNE10001040//1 , KIDNE1 0001430//1 , KIDNE20000700//1 , KIDNE20000850//1 , 
KIDNE20001670//7, KIDNE20003150//1 , KIDNE20003300//7, KIDNE20003490//4, KIDNE20004220//1 , 

5 KIDNE20005170//7, KIDNE20033050//2, KIDNE20033570//1 , KIDNE2003941 0//5, KIDNE200441 1 0//3, 
KIDNE20048280//12, KIDNE2004981 0//2, KIDNE20054770//12, KIDNE20060530//2, KIDNE20060620//2, 
KIDNE20063530//1, KIDNE20066520//2, KIDNE20067600//1 , KIDNE20071 860//1 , KIDNE20074220//1 , 
KIDNE20075690//5, LIVER1 0000580//3, LIVER1 0000670//1 ^ LIVER1 0001 040//2, LIVER1 00011 1 0//1 , 
LIVER10001750//1, LIVER200041 60//1 . MAMGL1 0001 7B0//1 , MAMGL1 0001 840//2, MESAN10001470//1 , 

10 MESAN10001800//7, MESAN20001490//2, NB9N420000420//1 , NHNPC20002060//2, NT2NE1 0000230//1 , 

NT2NE10001850//6, NT2NE20003920//1 , NT2NE20004550//1 , NT2NE20004700//1 , NT2NE20005500//1 , 
NT2NE20012470//2, NT2NE20014350//1 , NT2NE2001 6260//4, NT2NE20034080//2, NT2NE20047160//1 , 
NT2NE20055170//3, NT2NE20057200//1 , NT2RI20005970//7, NT2RI20014490//11 , NT2RI2001 6570//2, 
NT2RI20018460//1, NT2RI2001 8660//2, NT2RI20021520//7, NT2RI20022430//4, NT2RI20022520//3, 

15 NT2RI20030110//1, NT2RI2003051 0//2, NT2RI20033830//2, NT2RI200367B0//1 , NT2RI20044420//1 , 
NT2RI20049850//2, NT2RI20050870//8, NT2RI20051500//1 , NT2RI20066820//1 , NT2RI20068250//11 , 
NT2RI2007G4aO//1 , NT2RI20077540//4, NT2RI2Q078790//1 , NT2RI20081 880//3, NT2RI20085980//3, 
NT2RI20092890//2, NT2RI20094060//4, NT2RP60000320//10, NT2RP60000390//1 , NT2RP60001 090//1 , 
NT2RP70002380//4, NT2RP7Q0Q2590//5, NT2RP70003640//1 , NT2RP70004770//4, NT2RP70006240//1 , 

20 NT2RP70011660//11, NT2RP70015910//2, NT2RP7002151 0//1 , NT2RP70023790//2, NT2RP700261 90//1 , 
NT2RP70043730//3, NT2RP70047900//2, NT2RP70049250//1 , NT2RP70064080//3, NT2RP70071540//2, 
NT2RP70071770//13, NT2RP70072520//2, NT2RP7007381 0//3, NT2RP70075040//4, NT2RP70076170//2, 
NT2RP70079750//2, NT2RP70081330//2, NT2RP70081370//8, NT2RP70085500//2, NT2RP70090120//10, 
NT2RP70091490//3, NT2RP70093220//1 1 , NT2RP70093730//1 , NT2RP70094290//1 , NT2RP70094810//12, 

25 NT2RP700949B0//3, NT0NG1 0002570//2, NTONG20002650//4, NTONG20004920//1 , NTONG20008000//1 , 
NTONG20012220//1, OCBBF20002310//2, OCBBF20009980//1 , OCBBF200121 00//2, PLACE50000670//1 , 
PLACE50001050//1, PLACE60005550//2, PLACE60012810//2, PLACE6001 8860//7, PLACE600201 60//1 , 
PLACE60020840//6, PLACE60037a50//1 , PLACE60037450//1 , PLACE60047380//1 , PLACE60049930//1 , 
PLACE60050290//1 , PR0ST1 0002200//2, PRGST10002720//1 , PR0ST1 0005360//1 , PROST20000360//1 , 

30 PROST20001 760//4, PROST20029600//2, PROST20033020//1 , PROST20039220//3, PROST200441 60//1 , 

PROST20051430//1, PROST20054260//5, PROST200591 90//3, PROST20059430//3, PROST20069880//1 , 
PROST20072370//1, PUAEN1 0000570//1 , PUAEN1000161 0//1 , PUAEN10003220//1 , SKMUS20007740//1 , 
SKNMG1 00001 90//1, SKNMC10000290//1, SKNMC10002290//2, SKNMG1 000251 0//8, SMINT10000160//2, 
SMINT10000420//8, SMINT10000570//2, SMINT1 00011 80//1 , SMINT20000180//2, SMINT20002770//3, 

35 SPLEN20001340//1, SPLEN20002430//1 , SPLEN20002700//1 , SPLEN20003100//1 , SPLEN20004960//2, 
STOMA10000520//2, ST0MA1 00011 70//1 , STOMA20000320//1 , STOMA20002570//3, SYNOV20001770//2, 
TESTI10000420//1, TESTI10000960//1, TESTI20006000//1 , TESTI20009090//1 , TESTI20009700//7, 
TESTI2001 1 340//5, TESTI20012370//1 , TESTI20013520//4, TESTI2001 4200//9, TESTI2001 621 0//2, 
TESTI20016710//r TESTI2001 8620//2, TESTI20020020//2, TESTI2002081 0//8, TESTI2002251 0//3, 

40 TESTI20024670//2^ TESTI20025800//2, TESTI20026980//2, TESTI20027000//1 , TESTI20030370//1 , 
TESTI20031930//r TESTI20042870//3, TESTI20047120//5, TESTI20049940//2, TESTI20057420//1 , 
TESTI20058600//6, TESTI20067740//2, TESTI20069780//3, TESTI20074800//5, TESTI20077490//4, 
TESTI20079510//3; TESTI20080200//7, TESTI20081 440//1 , TESTI20087740//2, TESTI20088470//2, 
TESTI20136910//1, THYMU1 0001 760//1 , THYMU10003290//1 , THYMU10003820//4, THYMU10005580//4, 

45 TRAGH1 0000630//3, TRACH10001000//1 , TRACH10001400//1 , TRACH20001850//2, TRACH20001960//2, 
TRAGH20004960//2, TRAGH20006650//1 1 , TRAGH20007670//2, TRAGH20008980//2, TRAGH20015920//2, 
UMVEN20001330//2, UTERU10000770//2 

[0208] The Names of clones whose deduced amino acid sequences were detected to have functional domains with 

Pfam, and the name of hit functional domains are as follows. The search result is indicated as "clone name/Zfunctiona! 
50 domain name". When the clone has multiple hit functional domains, they are listed and demarcated by a double slash 

mark (//). When the clone has multiple hits of an identical functional domain, each is listed without abridgment. 

3NB691 00001 80//TS-N domain//UBA domain 

3NB691 0001 1 60//START domain 

3NB6910001290//KRAB box 
55 3NB691 0001 730//RI01/ZK632.3/MJ0444 family 

3NB6920002810//DEAD/DEAH box helicase/ZHelicases conserved C-temiinal domain 3NB6920009120//Zinc finger 

C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Frataxin-lil<e domain 

3NB6920010020//Regulator of G protein signaling domain 
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3NB6920014330//Domain of unknown function 

3NB692001471 0//DNA binding domain witli preference for A/T ricli regions//Zinc finger, C2H2 type 
3NB6920015110//RNA recognition motif. (a.l<.a. RRiVI, RBD, or RNP domain)//RNA recognition motif. (a.l<.a. RRiVI, 

RBD, or RNP domain) 

5 3NB6920015570//KRAB box//Zinc finger, C2H2type//Zinc finger, C2H2type//Zincfinger,C2H2type//Zincfinger,C2H2 
type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 

type//Zinc finger, C2H2 type/ZZinc finger, C2H2 type 
ADRGL10000020//BTB/POZ domain/ZKelch motif/ZKelcli motif 

ADRGL10000650//Zinc finger, C2H2type//Zinc finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
10 finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

ADRGL1 0001 600//Cytochrome P450//Cytoclirome P450 ADRGL1 0001 650//Urease//Cliioroliydrolase//Diliydroorota- 
se-lii<e 

ADRGL20000740//Doclcerin domain type i//RhoGAP domain 

ASTR01 00001 80//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
15 G-beta repeat 

ASTR020000950//SNAP-25 family 
ASTR0200041 70//Ribonuclease T2 family 

BGGI120005330//iMP dehydrogenase / GMP reductase N terminus//CBS domain//CBS domain/ZDiliydroorotate de- 
hydrogenase/yHistidine biosynthesis protein//FMN-dependent dehydrogenase/ZConserved region in giutamate syn- 
20 thase/ZIMP dehydrogenase / GMP reductase C terminus 
BGGI120005440//lmportin beta binding domain 
BGGI120006840//Sir2 family 

BGGi120006930//Goliagen triple helix repeat (20 copies)//SAI\/l domain (Sterile alpha motif) 
BGGi120010970//F5/8 type C domain/ZLaminin G domain/ZLaminin G 
25 domain/ZFibrinogen beta and gamma chains, C-terminai globular domain 

BGGi 12001 71 40//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2type//Zi no finger, C2H2type//Zincfinger, C2H2 
type//Zlnc finger, C2H2 type/ZTranscription factor S-ll (TFIiS)//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
BNGH410000290//SPRY domain 

BNGH4i 0000340//Prolyl oilgopeptidase famiiy/ZPhosphoiipase/Carboxytesterase 

30 BNGH41 0001 040//Eul<aryotlc protein i<inase domain 

BNGH410001180//Low-denslty lipoprotein receptor domain class A//Low-density lipoprotein receptor domain class A// 
Low-density lipoprotein receptor domain class A//WAP-type (Whey Acidic Protein) 'four-disuiflde coreV/Low-density 
lipoprotein receptor domain class A//Low-density lipoprotein receptor domain class A//Low-density lipoprotein receptor 
domain class A//Low-denslty lipoprotein receptordomain class A//iong chain scorpion toxins/ZChitin binding Perltrophin- 

35 A domain//Low-density lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-den- 
sity lipoprotein receptor repeat class B//Low-density lipoprotein receptor repeat class B//Low-density lipoprotein re- 
ceptor repeat class B 

BNGH41 0001 370//Filamin/ABP280 repeat. 

BNGH41 0001 770//iMP dehydrogenase / GMP reductase N terminus//CBS domain//CBS domain/ZDihydroorotate de- 
40 hydrogenase/ZHistidine biosynthesis protein//FMN-dependent dehydrogenase/ZConserved region in giutamate syn- 
thase//IMP dehydrogenase / GMP reductase C terminus 
BNGH410001900//Viral (Superfamily 1) RNAhelicase 

BNGH41 0001 980//POT family//Bacteriorhodopsin//Sugar (and other) transporter 

BNGH420005320//SCAN domain//KRAB box//Zinc finger, C2H2 type//GATA zinc finger//Zinc finger, C2H2 type/yZinc 
45 finger, G2H2 type//Zinc finger, G2H2 type//Zinc finger, G2H2 type/manscription factor S-li (TFIIS)//Zinc finger, G2H2 
type 

BRACE 10000420//Fatty acid desaturase/ZProtein phosphatase 2C 

BRACE 10000930//Zinc finger, C3HC4 type (RING finger)/n"RAF-type zinc finger//TRAF-type zinc finger/ZMATH do- 
main 

50 BRACE1 0001 150ZZDNA gyraseZtopoisomerase IV, subunit A/ZNucieosome assembly protein (NAP) 

BRACE1 0001 660ZZZinc finger, G2H2 type 
BRACE20002800ZZiQ calmodulin-binding motif 
BRACE20005650ZZATP synthase ab C terminal 

BRACE20006980ZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeat 
55 BRAGE200071 80ZZCalcitonin Z CGRP Z lAPP family 

BRACE20008850ZZZinc finger, C3HC4 type (RING finger) 
BRACE20010650ZZF-box domain. 

BRACE20013750ZZHepatitis C virus non-structural protein NS4a 
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BRACE2001 4920//Protein-tyrosine phosphatase 

BRACE20018550//Ank repeatZ/Ank repeat//Ank repeatZ/Ank repeat//Ank repeat 
BRACE2001 8590//Transmembrane 4 family 
BRACE20019440//Protein of unknown function DUF82 
5 BRACE2002091 0//Zinc finger, C3HC4 type (RiNG finger)//Zinc finger, C3HC4 type (RING finger)//E7 protein. Early 

protein//B-box zinc finger 

BRACE20022020//Eukaryotic protein kinase domain 
BRACE20024090//Homeobox domain 

BRACE20024680//Simiiarity to lectin domain of ricin beta-chain, 3 copies. 
10 BRACE20026850//short clnain dehydrogenase 

BRACE20027720//Metallo-beta-lactamasesuperfamily 

BRACE20027920//FGGY family of carbohydrate kinases 

BRACE20028120//Ras family//ADP-ribosylation factor family 

BRACE20031100//Domain of unknown function DUF20//Patched family 
15 BRACE20071 740//KRAB box//Zinc finger, C2H2 type/zTranscription factor S-li (TFiiS)//Zinc finger, C2H2 type//-Zinc 

finger C2H2type//Zlnc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc 

finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type/ZZinc finger, C2H2 type//Zinc finger, C2H2type//Zinc 

finger, C2H2 type BRACE2007401 0//EF hand//EF lnand//Mitoclnondriai carrier proteins//l\/litochondriai carrier proteins// 

iVIitochondriai carrier proteins 
20 BRACE20074470//Cadherin domain//Cadherin domainZ/Giutathione peroxidasesZZCadherin domain 

BRACE20076410/ZSushi domain (SCR repeat)//Sushi domain (SCR repeat)//Sushi domain (SCR repeat) 

BRAGE20076630//PH domain 

BRACE200B0970//Phosphofructokinase 

BRACE200B3800//Fibronectin type III domain 
25 BRACE200B3850//bZIP transcription factorZ/Homeobox associated leucine zipper BRACE200B4430//ThioredoxinZ/ 

Thioredoxin 

BRACE20092120ZZ3'-5' exonucleaseZZAdenyiylsulfate kinaseZ/Protein of unknown function DUF82 

BRACE20093610ZZBacteriai type il secretion system protein 

BRAWH10000940Z/Rieske [2Fe-2S] domainZZPhosphoglucose isomeraseZZFAD binding domainZZPyridine nucleotide- 
30 disuiphide oxidoreductaseZZPhytoene deliydrogenase related enzyme 

BRAWH10001300ZZPH domain/ZRhoGAP domainZZTropomyosins 

BRAWH1 0001 620ZZaiphaZbeta hydrolase fold 

BRAWH10001640Z/KRAB boxZZENV polyprotein (coat polyprotein) 

BRAWH10001680Z/Homeobox domain 
35 BRAWH20000480Z/TransposaseZ/Kinesin motor domain 

BRAWH20001 770ZZSerine hydroxymethyltransferase 

BRAWH20003230Z/Wiskott Aldrich syndrome homology region 2 

BRAWH20004430ZZLectin (probable mannose binding)/ZSurfactant associated polypeptide 
BRAWH20004760ZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
40 BRAWH20006330ZZZinc finger, C2H2 typeZZZinc finger. C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZ 
Zinc finger, C2H2 typeZZZinc finger C2H2 type/ZZinc finger C2H2 typeZZZinc finger, C2H2 type 
BRAWH2000651 OZ/HiVIGL-iike 

BRAWH20006860ZZEukaryotic protein kinase domainZ/Proteln kinase C terminal domain 
BRAWH20009840Z/Cytochrome P450 
45 BRAWH2001 1 660//Glycosyl hydrolases family 35 

BRAWH2001 2030//Phorbol esters/diacyigiycerol binding domain (C1 domain)//Zinc finger, C3HC4 type (RiNG finger) 
//PHD-flnger 

BRAWH20014180//Adenosine-deaminase (editase) domain 
BRAWH20014610//TS-N domain//UBA domain 
50 BRAWH20014840//Glycosyl transferases//Similarity to lectin domain of ricin beta-chain, 3 copies. 
BRAWH20036890//Protein phosphatase 20 

BRAWH20059980Z/CUB domain//Low-density lipoprotein receptor domain class 
/V/CUB domainZZLow-density lipoprotein receptor domain class ZVZFz domain 
BRAWH20060440//PPR repeat 
55 BRAWH20064500//Nuciear transition protein 2ZZHiVlG (high mobility group) box 
BRAWH20076050Z/Keratin, high sulfur B2 protein 

BRAWH20089560ZZPoly-adenylate binding protein, unique domain. //Magnesium ohelatase, subunit Chll//Uncharac- 
terized protein family UPF0034//KE2 family protein//Fonnin Homology 2 Domain 
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BRAWH20093600//Family 4 glycosyl hydrolase 

CD34C2000051 0//Glycosyl hydrolases family 18//Glycosyl hydrolases family 18//Chitin binding Peritrophin-A domain 
CTONG20005890//DNA gyrase/topoisomerase IV, subunit A//PDZ domain (Also known as DHR or GLGF).//PDZ do- 
main (Also known as DHR or GLGF). 

CTONG20011390//Prokaryotic dksA/traR C4-type zinc finger/ZHepatitis C virus non-structural protein NS2 
CTONG20013200//Uncharacterized protein family UPF0020 
CTONG20018200//PHD-finger//PHD-finger//PWWP domain//SET domain 

CTONG20019550//Spectrin repeat/ZXylose isomerase/ZSpectrin repeat/ZSpectrin repastZZSpectrin repeatZZFIagellar 
hook-associated protein 2ZZAdlnesin lipoproteinZZSpectrin repeatZZSpectrin repeatZZProtein of unknown function 
DUF118ZZSpectrin repeatZZBacterial flagellin N-tennlnusZZSpectrin repeatZZSpectrin repeatZZSpectrin repeatZZCaulimov- 
irus movement proteinZZSpectrin repeat 

CTONG20025580ZZZinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZ/PHD-flngerZ/ZInc finger 
C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
CTONG20028030ZZDomain of unknown function DUF19ZZRibosomal protein S18 

CTONG20028160ZZCadhenn domainZZCadherin domainZZCadherin domainZZCadherin domainZZCadherin cytoplasmic 
region 

CTONG20028200ZZPapain family cysteine proteaseZZE2 (early) protein, N terminalZZT-box 
CTONG20037820ZZNeurotransmitter-gated ion-channelZZNeurotransmitter-gated ion-channei 
GTONG20047160ZZPCI domainZZLatrophilinZCL-1 -iike GPS domain 

GTONG20055530ZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZPyridoxal-dependent de- 
carboxylase conserved domainZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZ 
Ank repeatZZAnk repeat 
CTONG20064490ZZPCI domain 

D30ST20001B40ZZRNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
DFNES20002120ZZQueuinetRNA-rlbosyltransferase 

DFNES20002680ZZProtozoanZcyanobacterial globinZZKE2 family protein/ZAdhesin lipoprotein 

FGBBF1 0005980ZZKRAB boxZZZinc finger, C2H2 typeZZZincflnger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 
typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZAN1-llke Zinc 
fingerZZZinc finger, C2H2 typeZZZincflnger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZZZinc finger, C2H2 type 

FCBBF1 0008870ZZCortlcotropin-releaslng factor family 
FCBBF20000940ZZHomeobox domain 
FCBBF20002320Z/T-box 

FCBBF20002760ZZKelch motlfZZKelch motlfZ/Kelch motif 

FCBBF2000591 OZZAdenylate kinaseZZViral (Superfamily 1 ) RNA helicaseZZTPR DomainZZTPR Domain 
FCBBF20008150ZZLIM domain containing proteinsZZLIM domain containing proteinsZZLIM domain containing proteins 
FCBBF2000951 OZZKRAB boxZZZinc finger, C2H2 typeZZZincflnger, C2H2 typeZZZincflnger, C2H2 typeZZPHD-fingerZZZinc 
finger G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 

FCBBF20012110ZZMyc amino-terminal region 
FCBBF20015380ZZGNS1ZSUR4 family 
FGBBF20016720ZZDomain of unknown function DUF94 
FCBBF40002820ZZEIectron transfer flavoprotein beta subunit 

FCBBF50002610ZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZPHD-fingerZZZinc finger 
G2H2 typeZZZinc finger, C2H2 type/ZTranscription factor S-ll (TFIIS)ZZZinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZ 
Zinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZincflnger, C2H2 typeZZZinc finger, G2H2typeZZ 
Zinc finger, C2H2 type 

FEBRA20000530ZZBTBZPOZ domalnZ/Kelch motlfZZKelch motif/ZKelch motlfZZKelch motifZZKelch motif 
FEBRA20001050ZZTPR DomainZZTPR DomainZZTPR DomainZZTPR DomainZ/PPR repeatZZTPR Domain 
FEBRA20003770ZZAnk repeatZZIronZmanganese superoxide dismutases (SODM)ZZRas association (RalGDSZAF-6) do- 

mainZZFERIVI domain (Band 4.1 family) 

FEBRA20003970ZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZincflnger, G2H2 typeZZZinc 
finger G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTranscription factor S-ll (TFIIS)ZZZinc finger, G2H2 
typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc fingerZZZinc finger, G2H2 type 
FEBRA20003990ZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZincflnger, C2H2 typeZZZincflnger, C2H2 typeZZZinc 
finger G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
FEBRA20004150ZZSTAS domain 

FEBRA20004540ZZZinc finger, G2H2 typeZZZincflnger, G2H2 typeZZBolA-like proteinZZZinc finger, G2H2 typeZZZinc finger, 
G2H2 typeZZZinc finger G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTranscrip- 
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tion factor S-l I (TFIIS)//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type// 
Zinc finger, C2H2 type//Zlnc finger C2H2 type//Zinc finger, C2H2 type 
FEBRA20005360//Cystatin domain 
FEBRA20007330//EF hand//EF hand 

FEBRA20007870//Zinc finger, C2H2type//Zinc finger, C2H2 1ype//Zinc finger, C2H2type//Zinc finger, C2H2 type/ZZinc 
finger C2H2 type 

FEBRA20008560//lmportin beta binding domain//Armadillo/beta-catenin-lil<e repeats 
FEBRA20008810//Actin 

FEBRA20009720//KRABbox//Zincfinger, C2H2 type//Zinc finger C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2type//Bol A-likeprotein//Zinc finger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type// 
Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type FEBRA20011330//Trypsin and protease infiib- 
itors//PCI domain 
FEBRA20011460//SCAN domain 

FEBRA20012450//Leucine ricli repeat N-terminal domain/ZLeucine Rich Repeat/ZLeucine Rich Repeat/ZLeucine Rich 
Repeat/ZLeucine Rich RepeatZ/Leucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeucine rich repeat 
C-terminal domainZZImmunoglobulin domainZ/Flbronectin type III domain 
FEBRA20014920ZZS-adenosylmethionine synthetase 

FEBRA20015840//EGF-like domainZZEGF-lil<e domalnZZEGF-IIke domalnZZEGF-IIke domalnZZEB moduleZZEGF-IIke do- 
mainZZEGF-like domain 
FEBRA20017060ZZImmunoglobulin domain 

FEBRA20017150ZZZinc finger, C3HC4 type (RING finger)ZZZinc finger, C3HC4 type (RING finger)/Zlnsulin-like growth 
factor binding protelnsZZB-box zinc fInger.ZZCONSTANS family zinc fIngerZZB-box zinc fInger.ZZPutatlve zinc finger in N- 
recogninZZFIbronectin type III domainZZSPRY domain 

FEBR/\20019890ZZPH domainZZPutatlve GTP-ase activating protein for ArfZZAnk repeatZZAnk repeat 

FEBRA20024290ZZRNA polymerase alpha subunit 

FEBRA20024420ZZGMCoxldoreductases 

FEBRA20025250Z/TBC domain 

FEBRA20034290ZZCAP-Gly domain 

FEBRA20043250ZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZZAnk repeatZ/Homeobox associated leucine zipper 
FEBRA20043290ZZMyosin tallZZIactateZmalate dehydrogenaseZZTroponinZZDomain present in Hsp70 regulatorsZZInter- 
leukin-6ZG-CSF/l\^GF familyZ/Myosin tail 

FEBRA20044900ZZPOU domain - N-terminal to homeobox domalnZZSpectrin repeatZZSpectrIn repeat 
FEB RA20045920ZZGIycoprotease f ami ly 

FEBRA20050140ZZZInc finger, C2H2typeZZZinc finger, C2H2 lypeZZZInc finger, C2H2typeZZZInc finger, C2H2 typeZZZInc 
finger, C2H2 typeZZZinc finger, C2H2 lypeZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZInc 
finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type FEBRA20050790ZZProtein-tyrosine phosphataseZZ 
Dual specificity phosphatase, catalytic domain 

FEBRA20057260//TBC domain 

FEBRA20057880//PDZ domain (Aiso known as DHR or GLGF). 
FEBRA20060920ZZDIX domain 

FEBRA20062700ZZhaloacid dehalogenase-like hydrolase 

FEBRA20064760ZZZinc finger, C2H2type/ZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZDIVl DNA binding domainZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger C2H2 typeZZ 
Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZGATA zinc fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZPHD-fingerZ/Zinc finger, C2H2 typeZZZInc finger, C2H2 typeZZTranscription factor S-ll (TFIIS)ZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
FEBRA20066670ZZTransthyretin precursor (formerly prealbumin) 

FEBRA20067360ZZKRABboxZZZincfinger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZZZInc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZInc finger, G2H2 

type 

FEBRA20069420ZZKRAB boxZZZinc finger, C2H2 typeZZRIbosomal protein L37eZZZinc finger, C2H2 lypeZZZInc finger 
C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, 

G2H2 typeZZZinc finger, C2H2 type 
FEBRA20070170ZZPX domain 

FEBRA20072000ZZTPR DomainZZTPR DomalnZZTPR DomalnZZTPR DomainZZTPR DomainZZTPR Domain 
FEBRA20075510ZZRas family 
HGASI\/I200021 40ZZGyclin 

HCASM20003070ZZRNA recognition motif, (a.k.a. RRIVI, RBD, or RNP domain) 
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H EART200041 1 0//POT family 
HEART20005060//Occludin/ELL family//K-box region 
HEART20005680//Nerve growtli factor famiiy 

HHDPC20000550//Viral (Superfamiiy 1) RNA heiicase//NB-ARC domainZ/Adenylate klnase/ZAdenyiate l<inase 
5 HHDPC20000950//'Extraceiluiar iinlc domain/ZLectin C-type domain 

HHDPC20001150//Collagen tripie helix repeat (20 copies)//C1q domain 

HHDPC20001490//'UBA domain/Zlntegrase Zinc binding domain//iBR domain//iBR domain 

HHDPC20003150//Zn-finger in Ran binding protein and otliers.//Zinc i<nuci<ie 

HHDPC20004550//FERM domain (Band 4.1 famiiy) 
10 HHDPC20004560//2S seed storage famiiy 

HHDPC20004620//FAD binding domain 

HLUNG10000240//Transforming growth factor beta lil<e domain 

iHLUNG10000370//TPR Domain//TPR Domain//TPR Domain//TPR Domain 

HLUNG10000760//Hi\/IG (high mobility group) box 
15 HLUNG10000990//Adenosylmethionine decarboxylase 

HLUNG20000680//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger// 

Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger C2H2 type//Zincfinger, C2H2 type//Zinc finger, C2H2type// 

Zinc finger, C2H2 type 

HLUNG20001420y/REV protein (anti-repression trans-activator protein)//Bacteriorhodopsin//Photosystem ii reaction 
20 centre T proteinZ/Sugar (and other) transporter/ZFecCD transport famiiy 
HLUNG20001760//Transthyretin precursor (formeriy preaibumin) 
H LU NG20002550/n"rypsin 
HSYRA1 0001 1 90//TBC domain 

HSYRA10001370//KRAB box//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-flnger// 
25 Zinc finger, C2iH2 type/ZZinc finger, C2iH2 type//BoiA-lil<e protein/ZZinc finger, C2iH2 type/ZZinc finger, C2iH2 type/ZZinc 

finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

HSYRA10001680//DEAD/DEAH box heiicase 

HSYRA1 0001 780//Alpha-2-macroglobulin famiiy N-terminai region 

HSYRA2000135G//F-box domain.//Kelch motifZ/Keich motif/ZKeich motif 
30 HSYRA20005100//UvrD/REP heiicase 

HSYRA20013320//lnsulin-lil<e growth factor binding proteins/ZThyrogiobuiin type-1 repeat 

HSYRA20014760//von Wlllebrand factor type A domain 

HSYRA20015740//Giucosamine-6-phosphate isomerase 

HSYRA20016210//HesB-ili<e domain 
35 HSYRA20016310//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//PHD-finger//Zinc finger, 

C2H2 type//Phorboi esters/diacyiglyceroi binding domain (CI domain)//Zinc finger, G2H2 type 

IMR321 000C440//ATP1 G1/PLM/MAT8 famiiy//Eul<aryotic protein l<inase domain 

IMR3210001580//Extracellular linl< domain/ZLectin C-type domain 

IMR3210002420//KRABbox//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zinc finger, C2H2 
40 type/ZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 

IMR3210002660ZZCation efflux family 

IMR3220002230ZZFHA domainZZHIT family 

IMR3220003020ZZSrc homoiogy domain 2 

iMR3220007420ZZZinc finger, C2H2 type 
45 iMR3220007750ZZNerve growth factor famiiyZZKazai-type serine protease inhibitor domalnZZThyroglobulin type-1 re- 

peatZZEF handZZimmunoglobulin domainZZImmunoglobulin domain 

iMR32200083B0ZZFomnyi transferase 

iMR3220009190ZZinfiuenza IVIatrix protein (i\/l1)ZZmetailopeptidase famiiy M24 
IMR3220009730ZZKinesin motor domain 
50 IMR322001 21 SOZZtRNA pseudouridine synthase 

IMR322001 31 ZOZZDuai specificity phosphatase, cataiytic domain 

KIDNE1 0001 040ZZMyb-lil<e DNA-binding domainZZApoiipoprotein A1ZA4ZE famiiyZZThymidyiate l(inaseZZSNAP-25 fami- 

lyZZSyntaxin 

KIDNE20000410ZZAminotransferases class-Ill pyridoxal-phosphate 
55 KIDNE2000051 OZZZinc finger, C2H2 typeZZZinc finger, G2H2 type//Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZANI -iike Zinc fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZPHD- 
fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
KiDNE20001670ZZSugar (and other) transporter 
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KIDNE20003150//Major intrinsic protein 
KIDNE20003300//DnaJ domain 

KIDNE20003490//Ubiquitin famiiy/ZViral matrix protein//Src liomology domain 2//Acyltransferase 
KIDNE20003750//G2 domain 
5 KIDNE20004030//RNAheiicase 

KIDNE20004970//Kinesin motor domain//K-box region 

KIDNE20005130//Aminotransferases ciass-ili pyridoxai-phosphate/ZAminotransferases ciass-ili pyridoxai-phosphate 
KIDNE20005170//Uncharacterized membrane protein family UPF0013 
KIDNE20031850//Ras association (RalGDS/AF-6) domain 
10 KIDNE20033050//Amidase//Amidase 

KIDNE20033730//SH3 domain/ZRhoGEF domain//PH domain 

KiDNE20039940//DNAgyrase/topoisomerase iV, subunit A//SCAN domain/ZZinc finger, C2H2 type//Zinc finger, C2H2 
type 

KIDNE20040840//Eul<aryotic protein l<inasedomain//Phosphoribuiol<inase//Myosinhead(motordomain)//i\^yosin head 
15 (motor domain) 

KIDNE20043440//Ribosomal protein L36 

KIDNE20044110//Viral methyltransferase//V-type ATPase 116l<Da subunit family 
KIDNE20046810//Dlenelactone hydrolase family 

KIDNE20048280//Amlno acid permease//Sodlum:neurotransmitter symporter family KIDNE20050420//Herpesvlrus 
20 UL25 famliy//Beige/BEACH domain//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta re- 
peat//WD domain, G-beta repeat//WD domain, G-beta repeat//AN1-like Zinc finger/ZFYVE zinc finger 
KiDNE20052960//Actin 

KiDNE20054770//Transmembrane amino acid transporter proteln//ion transport protein/ZAmino acid permease 
KiDNE20056290//Acetyitransferase (GNAT) famiiy 
25 KiDNE20056760//Gaiponin homoiogy (CH) domain 

KIDNE20059080//Armadiilo/beta-catenin-iii<e repeats//Armadilio/beta-catenin-iil<e repeats//Armadiilo/beta-catenin- 

like repeats 

KIDNE20060140//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 

G-beta repeat//WD domain, G-beta repeat 
30 KIDNE20060300//MutT-lll<e domain 

KIDNE20060530//Giycosyl transferase famiiy 8 

KIDNE20061490//SPRY domain 

KIDNE20062480//Scorplon short toxins 

KIDNE20062990//PH domain 
35 KiDNE20066520//Bacteriai extraceliuiar soiute-binding proteins, famiiy 5 

KIDNE20067600//lmmunogiobuiin domain/Zlmmunogiobuiin domain 

KIDNE20073520//WW domain 

KIDNE20075690//PMP-22/EMP/MP20/Claudln family 
KIDNE20078100//Ribosomai protein L15//lntegrase core domainZ/dUTPase 
40 KIDN E200781 1 0//KRAB box//Zlnc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//PHD-flnger//Zlnc finger, C2H2 type LIVER10000670//Urocanase 
LIVER1 0001 040//AMP-blndlng enzyme 

LIVER10002300//Respiratory-chain NADH dehydrogenase 51 Kd subunit 

LIVER1 0004330//Cyclic nucleotide-binding domain/ZGiutathione S-transferasesy/Uncharacterized protein famiiy 
45 UPF0028 

LIVER10005420//Bowman-Birk serine protease inhibitor famiiy 

LIVER20000330//Peptldase famiiy M1//K+ channei tetramerisation domain 

LIVER20000370//lmmunoglobulin domain/Zimmunogiobulin domain/Zimmunoglobulin domain 

MAMGL10000560//K-box region 
50 MAMGL1 0001 780//Lumenal portion of Cytochrome b559, alpha (gene psbE) subunit. 

MAMGL10001820//DIX domain 

MESAN10000350//Neurohypopliyslal hormones, C-terminal Domain 
M ESAN 1 0001 800//Sterol 0-acyltransferase 

MESAN20000920//SAM domain (Steriie alpha motif)//PDZ domain (Aiso known as DHR orGLGF).//Phosphatidyiinosi- 

55 tol 3- and 4-kinases 

MESAN20005010//PWWP domain 

NB9N410000470//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat 
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NB9N410001350//lactate/malate dehydrogenase//Ras family 
NB9N420001040//Na+/K+ ATPase C-terminus 

NHNPC10000840//RNA recognition motif, (a.k.a. RRIVI, RBD, or RNP domain)//RNA recognition motif, (a.k.a. RRM, 

RBD, or RNP domain) 
5 NHNPG20002060//DnaJ domain 

NHNPC20002120//Zinc finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type/ZPutative zinc finger in N- 

recognin/ZZinc finger, C2H2 type/ZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger 

C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 

NT2NE1 00001 40ZZZinc l<nucl<ieZZNucieotidyitransferase domainZZZinc knucl<ie 
10 NT2NE10000730ZZLeucine Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich Repeat 

NT2NE10000830ZZ7 transmembrane receptor (rhodopsin family) 

NT2NE1 0001 850ZZDIvalent cation transporter/ZTPR Domaln/ZTPR DomalnZZTPR Domaln/ZTPR Domaln/ZTPR DomainZ/ 
TPR DomalnZZTPR DomalnZZTPR DomalnZZTPR DomalnZZTPR Domain 
NT2NE20001740ZZRNA pseudouridylate synthase 
15 NT2NE200021 40ZZRhodanese-iike domainZZProtein-tyrosine phosphataseZZDual specificity phosphatase, catalytic do- 
main 

NT2NE20002590ZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc fingerZZZInc 
finger C2H2 type 

NT2NE20003690ZZCarbamoyl-phosphate synthase (CPSase) 
20 NT2NE20003840ZZTPR DomalnZZTPR DomainZZTPR Domain 
NT2NE20005360ZZRIbosomal protein S2 

NT2NE20005500ZZRetrovlral aspartyl proteaseZZRetroviral aspartyi protease 

NT2NE200065B0ZZZInc finger, C3HC4 type (RING finger)ZZICE-llke protease (caspase) p10 domaln/ZSPRY domain 
NT2NE20007630ZZMatrix protein (MA), pi 5 
25 NT2NE20008090Z/KRABboxZ/Zlncflnger, C2H2typeZ/Zlncfinger, C2H2type/ZZInc finger, C2H2type//Zincflnger, C2H2 
typeZZZinc finger, C2H2 type/ZPHD-flngerZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
NT2NE20013370ZZSPRY domain 

NT2NE2001 3720ZZTryptophan synthase alpha chalnZZRibulose-phosphate 3 eplmerasefamllyZZIndole-3-glycerol phos- 

30 phate synthases 

NT2NE2001 6260ZZ7 transmembrane receptor (rhodopsin family) 
NT2NE200i6660ZZDEADZDEAH box helicase 

NT2NE200340BOZZEGF-llke domainZZLamlnIn EGF-IIke (Domains III and V) 
NT2NE200471 BOZZGIycosyl transferase family 8 
35 NT2NE20053710ZZAnk repeat 

NT2NE20057200ZZUbiqultin-conjugatlng enzymeZZDNA mismatch repair proteins, mutS family 
NT2RI1 0000270ZZZinc finger C-x8-C-x5-C-x3-H type (and similar). 
NT2Ri10000480ZZDual specificity phosphatase, catalytic domain 

NT2Ri2000341 OZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
40 finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZDIVI DNA binding domainZZZinc finger, C2H2 typeZZ 
Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZPHD-fingerZZZinc finger, C2H2 type 
NT2Ri20004120ZZENTH domalnZZDNA polymerase (viral) C-termlnal domain 

NT2RI20004210ZZKRABboxZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2typeZ/Zlnc finger, C2H2 
45 typeZZZinc finger, C2H2 type 

NT2RI20006690ZZPIant thionlns 

NT2RI20006850ZZCollagen triple helix repeat (20 coples)ZZHIstone-llke transcription factor (CBFZNF-Y) and archaeal 
histone 

NT2RI2001 01 OOZZCarboxylesterasesZZCarboxylesterases 

50 NT2RI20010830ZZKRABboxZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTranscription factor S-li (TFiiS)ZZZinc fin- 
ger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
NT2Ri20014090ZZROKfamiiyZZSpectrin repeatZZTropomyoslnsZZSpectrln repeatZZSpectrIn repeat 

55 NT2Ri20014500ZZXyiose isomerase 
NT2Ri20015400ZZTPR Domain 
NT2RI20015950ZZKeratin, high sulfur B2 protein 
NT2RI20016210ZZBacterlal regulatory proteins, luxR family 
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NT2RI20016570//DnaJ central domain (4 repeats) 

NT2RI2001B460//Glutaminesynthetase//Notch (DSL) domain/ZNotch (DSL) domain 
NT2RI2001B660//lmmunoglobulin domain/ZSPRY domain 
NT2RI20020220//Phospliatidylinosltol-specific phospholipase C, X domain 
5 NT2RI20025170//PDZ domain (Also known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF). 
NT2RI20025300//Ublquitin family 

NT2RI20025410//Zinc finger C2H2type/yZlnc finger, C2H2type//Putative zinc finger in N-recognin//Zinc finger, C2H2 

type//Zlnc finger, C2H2type//Transcription factor S-ll (TFIIS)//Zinc finger, C2H2type//Zinc finger, C2H2 type//Zn-f inger 

Inublquitln-hydrolases and other proteims/ZZinc finger, C2H2type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
10 finger, C2H2 type 

NT2RI20025540//TPR Domain/ZTPR Domain/ZTPR Domain 

NT2Ri20025850//jmjN domain/ZimjC domain 

NT2Ri20029580//C2 domain//C2 domain 

NT2RI20029700//EF hand//EF hand 
15 NT2RI20030110//lmmunoglobulln domain 

NT2RI20031540//lnterleukln-6/G-CSF/MGF family 

NT2RI20032050//Armadlllo/beta-catenln-llke repeats//Armadillo/beta-catenin-llke repeats 

NT2RI20033440//PDZ domain (Also known as DHR or GLGF). 

NT2RI20036780//Subtilase famlly/ZProproteln convertase P-domain 
20 NT2RI20036950//Leucine Rich Repeat/ZLeucine Rich Repeat/ZLeucine Rich Repeat/ZLeuclne Rich Repeat/ZLeucine 

Rich RepeatZZLeucine Rich RepeatZZInsuiinZIGFZReiaxIn familyZZRibosomai RNA adenine dimethyiasesZZSAIWI domain 

(Steriie alpha motif)ZZTFiiE alpha subunitZZZinc finger, C3HC4 type (RiNG finger) 

NT2Ri2003751 OZZFonnamidopyrimidine-DNA giycosyiase 

NT2Ri20046060ZZK+ channei tetramerisation domain 
25 NT2Ri20049850ZZDomain of unknown function 

NT2RI20050610ZZPeptidase famiiy M1 

NT2RI20050870ZZVoltage gated chloride channeisZZXanthineZuracli perm eases famiiyZZSuifatetransporterfamiiyZZSTAS 

domain 

NT2RI200515QO//Slalyltransferase famllyZ/Photosynthetic reaction center protein 
30 NT2RI20053680/ZZInc finger, C2H2 type 

NT2RI20055640//Glutathlone S-transferases.ZZProtein of unknown function DUF61ZZGIutathione S-transf erases. 
NT2RI20056470/ZbZIP transcription factorZZTransposaseZZbZiP transcription factorZZOuter membrane effiux proteinZZ 
Intermediate filament proteins 

NT2RI20058110ZZGuanine nucieotide exchange factor for Ras-iike GTPases; N-tenninai motifZZRasGEF domain 
35 NT2Ri2006071 OZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc f ingerZZZinc 

finger, C2H2 typeZZZinc finger, C2H2 type 

NT2RI20062100//Src homology domain 2 

NT2RI20064120/ZRas famlly/ZCeil division protein 

NT2RI20066790//lmmunoglobulln domain 
40 NT2RI20067030/ZL1 (late) protein 

NT2RI20067350//Zlncflnger, G2H2typeZZZincfinger, G2H2typeZZZincfinger, C2H2 typeZZZinc finger, G2H2 typeZZTRAF- 

type zinc fingerZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc 

finger, G2H2 type 

NT2Ri20068250ZZDolichyi-phosphate-mannose-protein mannosyitransferaseZZS-adenosyimethionine synthetase 
45 NT2Ri20068550ZZHelicases conserved G-terminal domain 
NT2Ri20070480ZZAtrlal natriuretic peptide 
NT2Ri20070840ZZimmunogiobuiin domain 

NT2Ri20070960ZZHydroxymethyigiutaryi-coenzyme A reductaseZ/RhoGEF domainZZHpt domalnZZPH domain 
NT2RI20071330ZZKRAB boxZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 
50 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZTranscriptlon factor S-11 (TFIiS)ZZZinc 
finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc 
finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, G2H2 typeZZZinc 
finger, C2H2 type 

NT2RI20071480ZZWD domain, G-beta repeatZ/WD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, 
55 G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeat 

NT2RI20072540ZZRIbosomal RNA adenine dimethyiasesZZSAiVI domain (Steriie alpha motif)ZZTFI IE alpha subunitZZZinc 
finger, G3HC4 type (RiNG finger) 
NT2Ri20073840ZZEukaryotic protein kinase domain 
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NT2RI20074390//Zinc finger, C2H2 type//Zinc finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc 
finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zlnc finger, C2H2type//Zlnc finger, C2H2 type//Zlnc 
finger C2H2type//Zinc finger C2H2 type//Zlnc finger C2H2 type NT2RI20074690//Ublqulnol-cytochromeC reductase 
compiex 14i<D subunit 
5 NT2Ri20074980//Fz domain/ZZinc carboxypeptidase/ZZinc carboxypeptidase 
NT2Ri20078270//Acyl-CoA oxidase 

NT2Ri20078840//Homeobox domain/ZBacterial regulatory proteins, crp famliy/ZSIte-speclfic recomblnases/ZBacterial 
regulatory proteins, iuxRfamiiy 

NT2Ri20078910ZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, 
10 G-beta repeatZZWD domain, G-beta repeat 

NT2RI20080500ZZImmunoglobulln domalnZZImmunoglobulln domalnZZImmunoglobulln domalnZZImmunoglobulin do- 

mainZZImmunoglobuiln domainZZImmunogiobuiln domain 

NT2RI20083360ZZbZIP transcription factor 

NT2Ri2008481 OZZAcyltransferase 
15 NT2Ri200859aOZZBacteriai regulatory proteins, crp famllyZZCUB domalnZZF5Z8 type C domain 

NT2Ri20087140ZZSET domain 

NT2Ri200881 20ZZBindinZZHupFZHypC family 

NT2Ri20089420ZZimmunogiobuiin domainZZPKD domain 

NT2Ri2009Q650ZZZinc finger C2H2 typeZZZinc finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger C2H2 typeZZZinc 
20 finger C3HC4 type (RING finger)ZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C3HC4 type (RING 

f inger)ZZZinc finger, C2H2 type/ZPutatlve zinc finger In N-recognln/ZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 

finger C2H2 type 

NT2RI20091440ZZSPRY domain 

NT2RI20092150ZZSCAN domaln/Zlntegrase core domain 
25 NT2RI20092890ZZLeuclne rich repeat N-termlnal domalnZ/Leuclne Rich RepeatZ/Leucine Rich RepeatZZLeuclne Rich 

RepeatZZLeuolne Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich Re- 
peatZZLeuclne Ricli RepeatZZLeuclne Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich RepeatZZLeuclne rich repeat C- 

terminal domalnZZImmunoglobulin domain 

NT2RI20094060ZZDHHC zinc finger domain 
30 NT2RP60000320ZZCytochrome c oxidase subunit IIIZZ7 transmembrane receptor (Secretin family)ZZDomain found in 

Dishevelled, Egl-10, and Pleckstrin 

NT2RP60000720ZZi\/loiiuscan rhodopsin C-termlnal tali 

NT2RP60000860ZZUbiqultin-conjugatlng enzyme 

NT2RP60001000ZZKRABboxZ/Zlncflnger,C2H2typeZZZIncfinger,C2H2typeZZZIncflnger,C2H2 typeZZZinc finger, C2H2 
35 type/ZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc fInger/ZZInc finger, C2H2 type/ZZInc finger, G2H2 
typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
NT2RP60001 090ZZBTBZPOZ domainZZHIVIG (high mobility group) boxZZKelch motifZZKelch motlfZZKeich motifZZKelch mo- 
tifZZKelch molir 

NT2RP60001230/ZTPR DomainZZTPR DomainZZTPR DomalnZZTPR DomalnZZPPR repeatZZTPR Domain 
40 NT2RP60001 270ZZZinc finger C2H2 typeZZZinc finger C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc fingerZZZInc 

finger C2H2 type/ZZinc finger C2H2 type 

NT2RP70000690ZZiVlethyi-CpG binding domain 

NT2RP70002380ZZBacteriai export proteins, family 3 

NT2RP70002710ZZZinc finger C2H2 typeZZZinc finger, C2H2 type 
45 NT2RP70004770ZZTPR DomainZ/TPR DomalnZZTPR DomalnZZTPR DomalnZZTPR DomainZZTPR DomalnZZTPR Domain 

NT2RP70006240ZZIntegrlns, beta chalnZZRhoGEF domalnZZPH domainZZMORN motlfZZMORN motlfZZMORN motlfZZ 

MORN motlfZZMORN motifZZGoproporphyrinogen III oxidaseZZMORN motif 

NT2RP70010800ZZZAP domain 

NT2RP70011660ZZIronZmanganese superoxide dismutases (S0DM)ZZE1-E2 ATPaseZZDomaIn of unknown function 

50 DUF19ZZPhotosystem 11 reaction centre T protein 

NT2RP70012310ZZAIphavirus E3 giycoprotelnZZRNA recognition motif, (aA.a. RRM, RBD, or RNP domain) 
NT2RP70015910ZZKringie domainZZWSC domalnZZCUB domain 

NT2RP7001 8560ZZSAM domain (Sterile alpha motlf)ZZSterlle alpha motif (SAiVI)ZPointed domain 
NT2RP70023760ZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, 
55 G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZTBG domain 

NT2RP70024500ZZPicornavirus coat protein (VP4)ZZKRAB boxZZZInc finger C2H2 typeZZZinc finger C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger C2H2 typeZZIBR domalnZZZinc finger C2H2 typeZZZinc finger C2H2 type 
NT2RP70028750ZZAnk repeatZZAnk repeatZZAnk repeatZZCAP-Gly domain 
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NT2RP70029060//Oxysterol-binding protein//Hsp90 protein 
NT2RP70030550//PHD-finger 

NT2RP70032030//KRABbox//Zinc finger, C2H2type//Zincfinger, C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2 
type//Zinc finger, C2H2type//Zinc finger C2H2type//Zinc finger, C2H2type//Zinc finger, C2H2type//GATA zinc finger// 
5 Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

NT2RP70033040//Rhodanese-lil<e domain/Zlntegrase Zinc binding domain/Zintegrase Zinc binding domain//DnaJ cen- 
tral domain (4 repeats) 

NT2RP70036290//Glypican//Leucine Ricin Repeat/ZLeucine Rich Repeat 
NT2RP70036470//Porplnobilinogen deaminase/ZGHIVIP l<inases putative ATP-binding proteins 
10 NT2RP70036800//l\/lethanol deiiydrogenase beta subunit//BTB/POZ domain/ZKelch motif/ZKeioh motifZ/Kelch motif// 
Kelch motif/ZKeich motifZ/Keich motif 
NT2RP70039600ZZCalpain inhibitor repeat 

NT2RP70042040/ZPHD-finger/ZFYVE zinc finger/ZZinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 
typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
15 NT2RP70042330//TPR Domain/ZTPR DomainZZTPR DomainZZTPR DomainZZTPR Domain/ZTPR DomainZZTPR Domain 
NT2RP70043960//SH3 domain//SH3 domain//SH3 domain//SH3 domain//SH3 domain 
NT2RP70045410//Helix-loop-lnelix DNA-binding domain 
NT2RP70046560//PHD-finger//PHD-finger 

NT2RP70046870//l\/acroplnage migration inlnibitory factor (l\/liF)ZZWD domain, G-beta repeatZ/WD domain, G-beta re- 
20 peatZ/WD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeatZ/Arenavirus glycoprotelnZZ 
WD domain, G-beta repeat 
NT2RP70049150ZZPWWP domain 
NT2RP70049250ZZWH1 domain 

NT2RP70055020ZZSigma-54 interaction domainZZATPases associated with various ceilular activities (AAA) 
25 NT2RP70055130ZZKRABboxZZZinc finger, C2H2typeZZZincflnger, C2H2 typeZZZinc finger, C2H2typeZZZIncfinger, G2H2 

typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc 

finger//Zinc finger. C2H2 typeZ/ZInc finger, C2H2 typeZ/DM DNA binding domain/ZZinc finger, G2H2 type 

NT2RP70061620//Zlnc finger C2H2type//Zlnc finger G2H2 typeZZZinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc 

finger C2H2 type//Zinc finger C2H2 type 
30 NT2RP70061 880//GTPase-activator protein for Ras-iike GTPase 

NT2RP70062950//SNF2 and others N-terminal domainZ/SNF2 and others N-terminai domainZZLelshmanoiysinZZHeii- 

cases conserved C-termlnal domain 

NT2RP70063040//Plant PEC family metallothionein/ZCell division protein 

NT2RP70064900//KRABbox//Zinc finger, C2H2typeZZZincfinger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
35 typeZZPHD-fingerZZZincfinger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2typeZ/Transcription factor S-ii (TFI- 
IS)ZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZ/Zinc finger C2H2 typeZZTRAF-type zinc fIngerZZZinc finger, C2H2 type 

NT2RP70065270//LIM domain containing proteinsZZLiiVI domain containing protelnsZZLiiVI domain containing proteins/Z 

40 Helper component proteinase 

NT2RP70069860//KRABbox/ZZInc finger C2H2type//Zinc finger C2H2typeZ/Zlnc finger C2H2type//Zinc finger C2H2 
type/ZZinc finger C2H2 type//Zinc finger C2H2type//Zinc finger C2H2type//Zinc finger C2H2type/ZZInc finger C2H2 
typeZ/ZInc finger C2H2 type//Zlnc finger G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 
typeZZPutative zinc finger in N-recogninZ/ZInc finger, C2H2 typeZZDM DNA binding domainZZZinc finger, C2H2 type 

45 NT2RP70071 770//STAT protein//Zlnc finger G3HC4 type (RiNG finger) 
NT2RP70072210//Vlral (Superfamiiy 1) RNA helicase 

NT2RP70072520//PAS domain//PAS domainZZPAS domainZZEulcaryotic protein Icinase domain 
NT2RP70074060ZZGiutamine synthetase 

NT2RP70075370ZZZinc finger, G3HC4 type (RiNG finger)ZZB-box zinc finger.ZZGONSTANS famiiy zinc fingerZZPutative 

50 zinc finger in N-recogninZZSPRY domain 

NT2RP70076100//SAM domain (Sterile alpha motif) 
NT2RP70076430//ApolipQproteln A1/A4/E family 

NT2RP70079250//F5/8 type C domaln//Lamlnln G domain/ZLaminin G domain//EGF-ill<e domainZ/Thrombospondin N- 
termlnal -Wke domainsZZLamlnin G domain 
55 NT2RP70079750//Lamlnln G domain 

NT2RP70081370//Herpesvirus glycoprotein l\/l//ABC transporter/ZRibosomal SI 7 
NT2RP70081440ZZEukaryotic protein kinase domain 
NT2RP70081670ZZHelix-hairpin-hellx motif .ZZS1 RNA binding domain 



80 



EP 1 293 569 A2 



NT2RP70084060//Glycosyl transferases group 1 

NT2RP70084410//BromodorTiain//Brornodomain//BrorTiodomairi//BrorTiodomain//Bromodornai n//Bromodomain//BAH 
domain 

NT2RP70084870//Sulfotransferase proteins 
5 NT2RP70085500//lmmunoglobulin domain/Zlmmunoglobulin domain/Zlmmunoglobulin domain/Zlmmunoglobulin do- 
main/Zlmmunoglobulin domain/ZFibronectin type III domainZ/Fibronectin type III domalnZZFIbronectin type III domain 

NT2RP70085570ZZHeavy-metal-associated domainZZHECT-domain (ubiquitin-transferase). 

NT2RP70087200ZZKRAB boxZZKRAB boxZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
10 finger C2H2 typeZZDM DNA binding domainZZZinc finger, C2H2 type/ZZinc finger, C2H2 typeZZZinc finger C2H2 typeZZ 
Zinc finger, C2H2 typeZZZinc finger, C2H2 type 
NT2RP70088550ZZPH domain 

NT2RP70090120ZZCytochrome oxidase subunit llZZVoltage gated chloride channels/ZCBS domainZZCBS domain 
NT2RP70090190ZZZinc finger, C2H2 typeZ/Zinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZ 
15 Transcription factor S-ll (TFIIS)ZZZinc finger, C2H2 type/ZZinc finger, C2H2 type/ZZinc finger, C2H2 typeZ/Zinc finger, 
C2H2 typeZZZinc finger, C2H2 type/ZZinc finger, C2H2 type/ZZinc finger, C2H2 typeZ/ZInc finger, C2H2type//Zlnc finger, 

C2H2 typeZZZinc finger, C2H2 type 
NT2RP70091490ZZSugar (and other) transporter 

NT2RP70092360ZZImmunoglobulin domainZZImmunoglobulin domainZ/lmmunoglobulin domainZZImmunoglobulln do- 
20 mainZZImmunoglobuiin domainZZImmunoglobulin domainZ/ThioredoxinZZImmunoglobulin domainZ/lmmunoglobulln do- 
main//lmmunoglobulln domain//Thloredoxln//lmmunoglobulin domaln//Adenovlrus E3 region protein CR1//lmmu- 
noglobulln domain//lmmunoglobulln domain//lmmunoglobulln domaln//lmmunoglobulln domain 
NT2RP70093220//CbiM//Voltage gated chloride channels//CBS domaln//CBS domain NT2RP70093700//WD domain, 
G-beta repeatZ/Vlrlon host shutoff protein//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G- 
25 beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 
NT2RP70093940//IPT/TIG domain 

NT2RP70094810//Myelin proteollpid protein (PLP or llpophllln)//lnfluenza non-structural protein (NS1)//Proteln of un- 
known function DUF67//Patched familyZZ7 transmembrane receptor (metabotropic glutamate family) 
NT2RP70094980ZZEGF-lil<e domainZZEGF-lil<e domainZZTrypsIn Inhibitor like cysteine rich domainZ/EGF-iike domain/Z 
30 EGF-like domainZ/Trypsin inhibitor like cysteine rich domaln//EGF-llke domaln//von Wlllebrand factor type C domain// 
von Willebrand factortype C domainZZMetallothionelnZZvon Wlllebrand factortype C domaln//von Wlllebrand factortype 
C domain//von Wlllebrand factor type C domaln//von Willebrand factor type C domain 
NTONG10000520//BTB/POZ domain//Kelch motif//Kelch motif 

NT0NG1 0001 300//HlyD family secretion proteln//Blopterln-dependent aromatic amino acid hydroxylaseZ/Caspase re- 
35 cruitment domain 

NTONG10002570//Rhabdovirus spike glycoprotein 

NTONG10002640ZZPhosphoglucomutaseZphosphomannomutase 

NTONG20003340ZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type/ZZinc finger, C2H2typeZ/Zinc 
finger C2H2 type/ZZinc finger, C2H2 type 
40 NTONG20008780ZZBacterial regulatory proteins, iaci familyZ/Site-speclfic recomblnases 

NTONG20009660ZZNebulin repeatZZNebulin repeatZ/Nebulin repeatZZNebulln repeatZZNebuiin repeatZZNebulin repeatZZ 
Nebuiin repeatZZNebulin repeat 

NTONG20015500ZZZInc finger, C2H2 typeZZZinc finger, C2H2 type//Zinc finger, C2H2 type//Zlnc finger, C2H2 type// 
BolA-IIke protein/ZZinc finger, C2H2 type//Zinc finger, C2H2 type//Zlnc finger, C2H2 type//Zinc finger, C2H2 type//Tran- 
45 scriptlon factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zlnc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 
type 

NTONG20016120//PH domaln/ZPhosphoglycerate mutasefamlly//Oxysterol-blnding protein 

OGBBF1 000091 0//Sori3ln homologous domaln/ZPeptldase family M1//SH3 domaln//SH3 domain//SH3 domain 

OCBBF10001180//K+ channel tetramerlsation domain 

50 OCBBF1 0001 1 90//DNA topoisomerase II (N-terminal region) 

OGBBF10001220//BTB/POZ domaln/ZKelch motifZZKelch motifZZKelch motifZZKelch motlfZZKelch motif 
OCBBF20002310ZZLeucine rich repeat N-terminal domainZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeuclne Rich 
RepeatZZLeucine Rich RepeatZZLeuclne Rich RepeatZZLeuclne Rich Repeat/ZLeucine Rich RepeatZZLeucine Rich Re- 
peatZZLeucine Rich RepeatZZLeucine rich repeat C-terminal domain 

55 OGBBF20007190ZZMetallo-beta-lactamase superfamlly 

OCBBF20008240ZZbZIP transcription factorZ/tRNA synthetase class II (G, H, p S and T) 
OCBBF20010750//Spectrln repeat 

OCBBF20011010//Zinc finger, C2H2 type//Zlnc finger, C2H2 type//PHD-finger//Zinc finger, C2H2 type//Zinc finger, 
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C2H2 type/yZinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//FYVE zinc 
finger//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Transcription 
factor S-ll (TFIIS)//Zinc finger, C2H2 type//Zinc finger, C2H2type 
OCBBF20011240//Glutathione S-transf erases. 
5 OCBBF20011400//WD domain, G-beta repeat//K+ channel tetramerisation domain//7-fold repeat in Clatlirin and VPS 
OCBBF20011760//BTB/POZ domain/ZKelch motif//Keich motif/ZKeich motif/ZKelch motif/ZKeich motif 
OCBBF20012100//PAP2 superfamily 

OCBBF20013070//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
OCBBF20014940//UBA domain 
10 OCBBF20015270//Zinc finger, C2H2 type/ZBacterial type II secretion system protein 
OCBBF2001 5280//lactate/malate dehydrogenase 
OCBBF20015860//ATP synthase Alpha chain, C terminal 

PEBLI\/I10000340//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domainy/Zn-finger in Ran binding protein and 

others. 

15 PEBLM10000680//Actin 

PEBLM20001120/rrhymidylate kinase/ZLeucine Rich Repeat/ZLeucine Rich Repeat/ZLeucine Rich RepeatZZLeucine 

Rich Repeat/ZLeucine Rich Repeat 

PEBLM20002480//KRAB boxZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZTRAF-type zinc 
finger/ZZinc finger, C2H2 typeZZTranscription factor S-ll (TFIIS)ZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZZZinc 
20 finger C2H2 type 

PEBLM20002700Z/KRABboxZZZincfinger,C2H2 typeZZZinc finger, C2H2typeZZZincfinger,C2H2typeZZZincfinger,C2H2 
typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type/ZPHD-finger/ZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 

PEBLM20003080ZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
25 finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger C2H2 typeZZZinc finger, C2H2 type 
PEBLI\/I20003950ZZSCAN domain 

PEBLI\/l20004790ZZSrc homology domain 2//Eukaryotic protein kinase domain 
PLACE50000370ZZ7-fold repeat In Clathrin and VPS 
30 PLACE50000580ZZApolipoproteln A1ZA4/E family 

PLACE500006B0//Sushi domain (SCR repeat)Z/Sushi domain (SCR repeat) 

PLACE60002050ZZZinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 type 
PLACE60003790ZZLumenal portion of Cytochrome b559, alpha (gene psbE) subunit. PLACE60004290ZZGag P30 core 
shell protein 
35 PLACE60012810ZZAIVIP-blndlng enzyme 

PLACE6001 4430ZZmoaA Z nifB Z pqqE familyZZIVIoaC family 
PLA0E6001 8860ZZAdenylate and Guanylate cyclase catalytic domain 

PLACE60021020/Zlntegrase Zinc binding domain/Zlntegrase Zinc binding domain/ZDnaJ central domain (4 repeats) 
PLAGE60021510/ZKRAB boxZZZinc finger C2H2 typeZ/ZInc finger C2H2 type/ZPutative zinc finger in N-recognin/ZZinc 
40 finger C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 typeZ/Zinc finger, C2H2 typeZ/Zinc finger, C2H2type/ZZinc 
finger C2H2 type/ZZinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 typeZ/Zinc finger, C2H2 type 
PLACE60026680/ZSorbin liomologous domain//SH3 domain//SH3 domain 

PLACE60030380/ZZinc finger, C2H2type//Zinc finger, G2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc 
finger, C2H2 typeZZZinc finger, G2H2 type//Zinc finger, C2H2 type 
45 PLACE60032040/ZHirudin 

PLACE60037050ZZENV polyprotein (coat polyprotein) 

PLACE60038500ZZMItochondrial carrierproteinsZZMItochondrial carrier proteins PLAGE60044640ZZSmal I cytokines (in- 
tecrlneZchemoklne), interleukin-8 like 

PLAGE60046630ZZPhorbol estersZdIacylglycerol binding domain (G1 domain)ZZPHD-finger 
50 PROST10003430ZZPHD-fingerZZZinc finger, G3HC4 type (RING finger)ZZTRAF-type zinc fingerZZPDZ domain (Also 
known as DHR or GLGF).//PDZ domain (Also known as DHR or GLGF),/ZPDZ domain (Also known as DHR or GLGF).// 
WHEP-TRS domain containing proteinsZZPDZ domain (Also known as DHR or GLGF). 

PROST10005360ZZF5/8type Cdomain/ZLaminin G domainZ/Laminin G domain//EGF-iike domainZZFibrinogen beta and 
gamma chains, C-terminai giobuiar domain 
55 PROST20003250ZZRNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 

PROST2001 8230ZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, G2H2 type 
PROST2001 8990ZZADP-ribosylation factor familyZZRas family 
PROST20023380ZZK+ channel tetramerisation domalnZZBTBZPOZ domain 
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PROST20029600//Small cytokines (intecrine/chemokine), interleukin-8 like//lmmunoglobulin domain 
PROST20031 1 70//Heavy-metal-associated domain//HECT-domain (ubiquitin-transferase). 
PROST20033380/n"PR Domain/ZTPR Domain//TPR Domain 
PROST20033400//Eukaryotic protein kinase domain 
5 PROST20043320//Paramyxovirus nucleocapsid protein//SH3 domain 
PROST200441 60/AT ropomyosins 

PROST20051 21 0//Protein phosplnatase 2C//Protein phosphatase 2C 
PROST20064500//Sulfotransferase proteins 

PROST20067370//TRAF-type zinc finger//DnaJ central domain (4 repeats) 
10 PROST20069880//Atrial natriuretic peptide 

PROST20072890//K+ channel tetramerisation domain//BTB/POZ domain 

PROST20073170//K+ channel tetramerisation domain//BTB/POZ domain//Zinc finger, C2H2 type//Zinc finger, C2H2 
type//Zinc finger present in dystrophin, CBP/p300 
PROST20073890//Platelet-derived growth factor (PDGF) 
'5 PROST200851 6Q//Tropomyosins//Tropomyosins 
PROST20094830//PH domain 

PUAEN100Q3220//Photosystem I reaction centre subunit Viii 
SALGL10000050//Permeases for cytosine/purines, uracii, thiamine, allantoin 
SALGLIOOOOeSO/ySAM domain (Sterile alpha motif)//Sterile aipha motif (SAM)/Pointed domain 
20 SALGL1 0001 570//Collcin pore forming domain/ZiVlotA/ToiQ/ExbB proton channel family 

SKMUS10000140//Ubiquitinfamily//Ubiquitinfamily//Ubiquitinfamily//Ubiquitinfamiiy//Ubiquitinfamiiy//Ublquitinfamliy 
SKMUS10000220//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat//WD domain, 
G-beta repeat//WD domain, G-beta repeat//WD domain, G-beta repeat 

SKMUS1 0000640//Zlnc finger, C3HC4 type (RING fingery/Zinc finger, C3HC4 type (RING finger)//PHD-flnger//B-box 
25 zinc f inger.//3'5'-cyciic nucleotide phosphodiesterase 
SKMUS10001040//bZiP transcription factor 
SKMUS10001180//Coronavirus S2 glycoprotein 

SKMUS10001290//MutT-like domain 

SKMUS1 0001 770//Protein-L-isoaspartate(D-aspartate) 0-methyltransferase (PCMT) SKMUS20000740//ublE/COQ5 
30 methyltransferase family//Cyclopropane-fatty-acyl-phospholipid synthase 
SKMUS20001170//ATP synthase Alpha chain, C terminal/ZMAGE family 
SKMUS20002710//Hepatltls C virus capsid protein 
SKMUS20003900//IVIOV34/MPN/PAD-1 family 

SKMUS20004580//LII^ domain containing proteins/ZNebuiin repeat/ZNebuiin repeat/ZNebuiin repeat/ZNebulin repeat// 
35 Nebulin repeat/ZNebuiin repeat/ZNebuiin repeat/ZNebuiin repeat/ZNebuiin repeat 

SKMUS20007240//Thiamine pyrophosphate enzymes/ZThiamine pyrophosphate enzymes/ZThiamine pyrophosphate 

enzymes 

SKMUS20008630//OB-fold nucleic acid binding domaln/ARNA synthetases class 11 (l=)//tRNA synthetases class II (D, 

Kand N) 

40 SKMUS20009540//F-box domain. 

SKMUS20011290//lron-containing alcohol dehydrogenases/Ziron-containing alcohol dehydrogenases 
SKMUS20013640//Laminin EGF-like (Domains III and V) 
SKMUS20016340//HI\/1G (high mobility group) box 

SKMUS20016620//Ank repeat//Ank repeat/ZGIutamine amidotransferases ciass-il//Ank repeat 
45 SKMUS2001 6680//Phorbol esters/diacyigiyceroi binding domain (C1 domaln)//CONSTANS family zinc f inger//SH3 do- 
main 

SKNMC10000290//Zinc finger C-x8-C-x5-C-x3-H type (and similar). 

SKNMC1 000251 0//ABC transporter transmembrane reglon.//Phosphoribulokinase//ATPases associated with various 
cellular activities (AAA)//ABC transporter 
50 SKNMC20000650//Zinc finger, C2H2 typeZ/Protein phosphatase 2C//Zinc finger, C2H2 type//Zinc finger, C2H2 type// 
Zinc finger, C2H2 type//Zinc finger C2H2 type//Zinc finger, C2H2 type 

SKNMC20000970//RNA recognition motif, (a.k.a. RRM, RBD, or RNP domain)//Elongation factor TS//Protein-L-iso- 
aspartate(D-aspartate) 0-methyltransferase (PCMT)//I^et-10+ like-proteins 

SKNMC20002240//KRAB box//Zinc finger, C2H2 type//LIM domain containing proteins/ZZincfinger, C2H2 type//TRAF- 
55 type zinc finger/ZZinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//LIM 
domain containing proteins//PHD-finger//Zinc finger, C2H2type 
SKNiVIC20003560//Helix-ioop-helix DNA-binding domain 
SKNMC20010570//F-box domain. 
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SKNMC20015030//Keratin, high sulfur B2 protein 

SKNMC2001 5960//Anl< repeat//Ani< repeat//Ank repeat//Anl< repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat//Ank repeat/ZBacterial stress protein//Ank re- 
peat//Formylmethanofuran — tetrahydromethanopterin formyltransferase//Ank repeat//Ank repeat//Ank repeat//Neu- 
raxin and IVIAPI B proteins//FYVE zinc finger 
SKNSH1 0001 740//Pyridoxai-dependent decarboxylase 
SKNSH1 000301 0//SH3 domain 

SKNSH20003470//Heme-binding domain in cytochrome b5 and oxidoreductases 
SiVliNTI 00001 60//UDP-giucoronQsyi and UDP-giucosyl transferases 

SiVliNTI 0000420//Cytochrome oxidase subunit II//ABC transporter/ZBiopterin-dependent aromatic amino acid hydrox- 
ylase 

SMINT10000570//lmmunoglobulin domalnZ/lmmunoglobulin domaln//lmmunoglobulln domain 
SMINT1 000071 0//Immunoglobulin domain 

SiVliNTI 0001030//Ank repeat//Ank repeat//Ank repeat//Ank r6peat//Ank repeat//Ank repeat//Ank repeat//Ank repeat// 
Ank repeat//Ank repeat 

Si\/liNT20002270//Disintegrin//Trans-activation protein X 

SiVliNT20002770//Transcriptional regulatory protein, C terminal//lmmunogiobuiln domain 
SPLEN10001430//HiVIG (high mobility group) box 

SPLEN20000720//Zinc finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//TRAF-type zinc finger/ZZInc 
finger C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 
SPLEN20001340//Peptidase family M20/IVI25/i\/l40 
SPLEN20001970//Transcrlption factor TFIIB repeat 
SPLEN20002670//WD domain, G-beta repeat 

SPLEN20003570//RasGEF domaln//Ras association (RalGDS/AF-6) domain 

STOMA10001860//Cytosollc long-chain acyi-CoAthloester hydrolase//OB-fold nucleic acid binding domain//Cytosoilc 
long-chain acyl-CoA thioester hydrolase 
STOi\yiA20000880//lmmunoglobulln domain 

STOi\yiA20001210//Cys/i\/let metabolism PLP-dependent enzyme//Amlnotransferases class-l 

STOi\yiA20002570//iVlgtC family 
STOiVlA20002890//Adaptin N terminal region 

STOi\/IA20003960//LliVI domain containing protelns//LIM domain containing proteins STOMA20004820//PH domain// 
EF hand//EF hand//Phosphatidyiinositol-speciflc phospholipase C, X domain 
SYNOV10001280//Llpoate-protein llgase B 

SYNOV20013740//KRAB box//Bacterlal type II secretion system protein l/J//Zlnc finger, C2H2 type//Zlnc finger, C2H2 
type//Zlnc finger, C2H2 type//Zlnc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zlnc finger, C2H2 
type//Zinc finger, C2H2 type//Zlnc finger, C2H2 type 

SYNOV2001451 0//SRF-type transcription factor (DNA-blndIng and dimerisatlon domain) 
SYNOV2001 6480//giycosyi transferase family 
TESTi10000420//K-box region/ZPenicillln amidase 
TESTil 000051 0/zTransient receptor 
TESTi10000550//Homeobox domain 

TESTil 0000640//K+ channel tetramerlsation domain//BTB/POZ domaln//Kelch motlf/ZKelch motlf//Kelch motif//Kelch 

motif/ZKeich motif 

TESTil 0000700/ZUblqultln carboxyl-terminal hydrolases family 2/ZUbiqultln carboxyl-terminal hydrolase family 2 
TESTil 0001 270//PLAT/LH2 domain//PLAT/LH2 domalnZZPLATZLH2 domain 
TESTI10001380/ZSubtllase familyZZProprotein convertase P-domaIn 

TESTil 0001 680/ZLeuclne Rich Repeat/ZLeuclne Rich Repeat/ZLeuclne Rich Repeat/ZLeucine Rich RepeatZZLeucine 
Rich RepeatZ/Leuclne Rich Repeat 
TESTI20001200/ZKRAB box 
TESTI20001540ZZEukaryotic protein kinase domain 

TESTI20001770//von Wiliebrand factor type A domainZZProproteln convertase P-domaIn 
TESTI20002070/ZNifU-iike N terminal domain 
TESTI20002380/ZExonuciease/Z3'-5' exonuclease 
TESTi20002530/ZUbiquitln family 
TESTI20003560/ZTubulinZFtsZ family 

TESTI20005910ZZAdenyiate kInaseZZEIongatlon factor Tu famllyZZAdenylate kinase//6-phosphofructo-2-klnase//Shiki- 
mate kInaseZZpKID domalnZZAdenylate klnaseZZThymldylate kInaseZZATPases associated with various cellular activities 
(AAA) 
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TESTI20006000//Ank repeat//CAP-Gly domain 

TESTI20006270//TPR Domain//TPR Domain//TPR Domain//4-hyclroxyphenylpyruvate dioxygenase C terminal do- 

main//TPR Domain//TPR Domain 

TESTI20006950//Tudor domainZ/Stathmin family 
5 TESTI20006990/yKOW motif/ZKinesin motor domain 

TESTI20007070//DIVI DNA binding domain 

TESTI20007840//Apollpoproteln A1/A4/E family 

TESTI20008490//Apollpoproteln A1/A4/E family 

TESTI20008830//lmmunoglobulln domain 
10 TESTI20010490//KRABbox//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2type//Zlnc finger, C2H2 

type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zlnc finger, C2H2 type//Zlnc finger, C2H2 type//Zinc finger, C2H2 

type//Zinc finger, C2H2 type//Zinc finger, C2H2 type//Zinc finger, C2H2 type 

TESTi2001 1 41 0//RhoGEF domain//PH domain/ZPhorboi esters/diacyigiyceroi binding domain (C1 domain)//CXXC zinc 
flnger//FYVE zincfinger//PH domain 
'5 TESTI20012370//K+ channei tetramerisation domain//BTB/POZ domain/ZOrnlthlne decarboxylase antlzyme/ZKelch 
motlfZ/Kelch motlf/ZKelch motif/ZKeich motif/ZKeich motif 

TESTI20012690ZZBIotin-requlring enzymesZZBiotin-requiring enzymesZ/2-oxo acid dehydrogenases acyltransferase 
(catalytic domain) 

TESTI20013300ZZEF handZZEF handZZUblqultin carboxyl-terminal hydrolases family 2 
20 TESTI2001 3450ZZDouble-stranded RNA binding motifZZAideliyde oxidase and xanthine dehydrogenase, C termlnusZZ 
Adenosine-deaminase (editase) domain 

TESTi2001 4200ZZABC 3 transport famiiyZZSugar (and other) transporter 
TESTi20015110ZZbZiP transcription factorZZTroponinZZDomain of unl<nown function DUF87 
TESTi2001 5560ZZK+ channei tetramerisation domalnZZBTBZPOZ domain 
25 TESTi20016610ZZLeptin 

TESTI2001 8150ZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZinc 
finger C2H2 type 

TESTI2001 8270/ZTransketolase//Dehydrogenase E1 component/ZTransketolase 

TESTI2001 8520/ZF5Z8 type C domalnZ/LamlnIn G domainZ/Ribosomal protein LUZZThrombospondln N-termlnal -like 
30 domainsZ/LamlnIn G domalnZZEGF-IIke domalnZ/Flbrinogen beta and gamma chains, C-termlnal globular domain 
TESTI2001 8690ZZSAM domain (Sterile alpha motif) 

TESTI2001 8790ZZKRAB boxZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZInc finger, C2H2 
typeZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZInc finger, C2H2 typeZZZInc finger, C2H2 typeZZZinc finger, C2H2 
type 

35 TESTi20020570ZZE1 Protein, N terminai domainZZActin 

TESTI2002081 0ZZ7 transmembrane receptor (metabotropic giutamate famliy)ZZTransmembrane amino acid transporter 
protein 

TESTI20021050Z/TPR DomalnZZTPR DomalnZZTPR DomainZZTPR DomainZZTPR DomainZZTPR DomalnZZTPR Domain 
TESTI20021490ZZBTBZPOZ domalnZZZinc finger, C2H2 typeZZPHD-fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 

40 typeZZZInc finger, C2H2 typeZ/ZInc finger, C2H2 type 

TESTI20022230/ZNucleosome assembly protein (NAP) 
TESTI2002251 OZZCalretlcuiln familyZ/PHD-flnger 

TESTI20022560ZZLeuclne Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeuclne Rich RepeatZZLeucine 
Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZLeucine Rich RepeatZZGuanylate kInaseZZVesicuiovirus 
45 phosphoprotein 

TESTi20024980ZZPDZ domain (Aiso known as DHR or GLGF).ZZSH3 domalnZZGuanyiate kinase 
TESTi20025160ZZIVIAGE famiiy 

TESTi20025800ZZiactateZmaiate dehydrogenaseZZPyrldlne nucieotide-dlsuiphide oxidoreductase 
TESTi20026760ZZSPRY domain 
50 TESTI20027070ZZType I phosphodiesterase Z nucleotide pyrophosphatase 
TESTI20027290ZZRhoGAP domain 

TESTI20027890ZZKRAB boxZZDIctyosteilum (slime mold) repeatsZZDictyostelium (silme moid) repeatsZZZInc finger, 
C2H2 typeZZDIctyostellum (slime moid) repeatsZZZInc finger, C2H2 typeZZDIctyostelium (slime mold) repeats 
TESTI20029120ZZEukaryotic protein kinase domain 
55 TESTI20030050ZZHIstone-like transcription factor (CBFZNF-Y) and archaeai histone 

TESTI20030370ZZMYND fIngerZZTPR DomainZZTPR DomainZZTPR DomainZZAdaptln N terminal region 
TESTi20030710ZZHerpesvirus UL25 famiiy 

TESTi20031090ZZAnnadiiioZbeta-catenln-iikerepeatsZZAmnadliioZbeta-catenin-iikerepeatsZZArmadiiloZbeta-catenin-iike 
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repeats//Armadillo/beta-catenin-like repeats//Armadillo/beta-catenin-like repeatsZ/Aimadillo/beta-catenin-like re- 
peats//Atmadillo/beta-catenin-like repeats 
TESTI20031300//TPR Domain 
TESTI20031520//mRNA capping enzyme 
5 TESTi20031 960//WD domain, G-beta repeat 
TESTi20032280//iVlyb-iike DNA-binding domain 

TESTi20033250//UBX domain/ZOrotidine 5'-phosphate decarboxylases 

TESTi20033270//DiVI DNA binding domain 

TESTi20033540//Zinc finger, C2H2 type 
10 TESTi20033560//F-box domain. 

TESTI200341 90//ATP synthase Alpha chain, C terminal//AMP-binding enzyme 

TESTI20034980//RhoGEF domain 

TESTI20035120//C2 domaln/ZKInesIn motor domain 

TESTI20035510//NOL1/NOP2/sun family 
15 TESTI20035890//UBA domain/ZZinc finger C-x8-C-x5-C-x3-H type (and similar). 

TESTi20036250//TSC-22/dip/bunfamily//NAD dependent epimerase/dehydratase family/ZAdenylate kinase/ZATPases 

associated with various ceiiuiar activities (AAA) 

TESTi20037810/ZEukaryotic protein kinase domain 

TESTi20038940/ZiQ caimoduiin-binding motifZZiQ calmodulin-binding motif//IO calmoduiin-binding motif 
20 TESTi20040000/Zshort chain dehydrogenase/Z3-beta hydroxysteroid dehydrogenaseZisomerase family 
TESTI20040310/ZProteln of unknown function DUF84 
TESTI20041220/ZVPR/VPX protein 
TESTI20042870/ZET module 

TESTI20042950/Z3'5'-cyclic nucieotlde phosphodlesterase/ZPeptldase family M1 

25 TESTI20049820//Cycilc nucleotide-binding domain 

TESTI20053960/ZKRAB boxZZZInc finger, C2H2 typeZ/ZInc finger, C2H2 type/ZZInc finger, C2H2 lypeZZZInc finger, C2H2 
typeZZTranscription factor S-ll (TFIIS)ZZZinc finger, C2H2 typeZZZinc finger, C2H2 type//Zinc finger C2H2 type/ZTRAF- 
type zinc fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZ/Zinc finger, C2H2 lypeZZZInc finger, C2H2 type/ZZInc 
finger C2H2typeZZZincfinger, C2H2type/ZPhorbol esters/diacylglyceroi binding domain (C1 domaln)/ZZinc finger, C2H2 

30 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
TESTI20055840ZZPH domainZZPH domain 

TESTI20056900/ZUrease, gamma subunlt/ZIQ calmodulln-blnding motlf//IQ calmodulin-binding motif 
TESTI2005731 0//Tropomyoslns 

TESTI20057420/ZAcyl CoA binding proteln/ZRIbosomal Proteins L2 
35 TESTI20064830/ZTetrahydrofolate dehydrogenase/cyclohydrolase 

TESTI20068660/ZDomaln of unknown function DUF19/ZTPR Domaln/ZTPR Domaln/ZTPR Domain 
TESTI20068720/ZZinc finger, C2H2 typeZZZinc finger, C2H2 type 

TESTi20074640//KRAB box/ZZinc finger, C2H2 typeZZiVIYND fingerZ/Zinc finger C2H2 typeZZZinc finger, C2H2 typeZZ 

Zinc finger, C2H2type/ZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2typeZZ 
40 Zinc finger, C2H2 typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 type TESTi20074660Z/KRAB box/ZZinc finger, C2H2 

typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2type//Zinc finger, C2H2 type//Zlnc finger, C2H2 

typeZZZinc finger, C2H2 typeZ/PHD-fingerZ/Zinc finger, C2H2 type//Zlnc finger, C2H2 type//Zlnc finger, C2H2type//Zlnc 

finger C2H2 typeZ/IVlYND finger//Zinc finger, C2H2type 

TESTI20074800//Glyplcan 
45 TESTI20077490//Slgnal peptidase (SPase) II 

TESTI20078640//SCAN domain 

TESTI20078720//ATP synthase BZB' CF(0)//Rlbosomal L29 protein TESTI2007951 0//Immunoglobulin domaln//immu- 
noglobulln domain//Adenovlnjs E3 region protein CR1//lmmunoglobulln domaln/Zlmmunoglobulln domaln/ZFibronectln 
type III domaln//Flbronectln type Mi domain 

50 TESTI20080200//i\ZlttB family UPF0032 

TESTi20080330/ZRibosomai protein L14p/L23e 
TESTi20083430//TPR Domain 

TESTi20083870/ZEF handZZEF handZ/EF handZ/Phosphatldyllnositol 3- and 4-klnasesZ/EF hand 
TESTi20086570//iVIAGE family 
55 TESTi20087740//TPR DomainZ/TPR Domaln/ZOuter membrane efflux proteln/ZTPR Domaln/ZTPR Domain 
TESTi201 38320//Transketolase 
TESTI20140360//metaliopeptldase family M24 

TEST120177400//WD domain, G-beta repeatZ/WD domain, G-beta repeatZZWD domain, G-beta repeat 
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THYMU10000830//FAD binding domain 
THYiVIU 1 0001 760//lmmunogiobuiin domain 
THYIVIU1 000291 0//Adaptin N terminal region 
THYMU10003590//PH domaln/ZRhoGAP domain 
5 THYMU10004590//'HMG (liigh mobility group) box 
THYMU10005580//Synaptobrevln 

THYMU20002360//'Pumilio-family RNA binding domains (aka PUI\/I-HD, Pumilio homology domain) 
THYI\/IU20003690//Prokaryotic DNA topoisomerase/ZProtein of unknown function DUF122//Eukaryotic protein kinase 
domain 

10 TRACH1 0000740//lmmunoglobulin domain/Zlmmunogiobulln in domain/Zlmmunoglobulin domain/Zlmmunogiobuiin do- 
main 

TRACH1 0001250ZZimmunoglobulin domainZZImmunoglobulin in domainZZImmunoglobulin domainZZImmunoglobulin do- 
main 

TRACH20000150ZZFatty acid desaturaseZZProtein phosphatase 2C 

15 TRACH2000185QZ/Moiiuscan rhodopsin C-terminai tail 

TRACH20002370ZZKRAB boxZZZinc finger C2H2 typeZZFYVE zinc fingerZZZinc finger, C2H2 typeZZZinc finger, C2H2 
typeZ/Zinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 type 
TRACH20002500ZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeal/ZWD domain, 
G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeatZZWD domain, G-beta repeat 

20 TRACH20002890ZZPH domainZZSrc homology domain 2 

TRAGH20003930ZZRNA recognition motif, (a.k.a. RRM, RBD, or RNP domaln)ZZRNA recognition motif, (a.k.a. RRM, 
RBD, or RNP domain) 
TRACH20004110ZZZinc finger, C2H2 type 

TRACH20004200/ZNeurohypophyslai hormones, G-terminal DomainZZKeratIn, high sulfur 82 protein 
25 TRACH20004720/ZAminotransferases class-liZ/Amlnotransferases class-l 
TRACH20004960ZZAMP-blndlng enzyme 

TRACH20006650ZZLacY protonZsugar symporterZZSugar (and other) transporter 

TRACH20006750ZZE1 Protein, N terminal domainZZATP synthase (EZ31 kDa) subunit TRAGH20009260Z/short chain 
dehydrogenase 

30 TRACH20012890ZZRNA recognition motif, (a.k.a. RRM, RBD, or RNP domain) 
TRACH2001 6070ZZAdenyiate cyclase 
UiVlVEN10001220ZZCorticotropin-releasing factor family 
UMVEN20001330ZZC2 domalnZZC2 domainZZC2 domain 

UTERU1 0001 600ZZSCAN domalnZZZInc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, C2H2 typeZZZinc finger, 
35 C2H2 typeZZZinc finger, G2H2 typeZZZinc finger, C2H2 type 
UTERU1 0001 920ZZIntegrase core domain 

EXAiVIPLE 6 

40 Functional categorization based on the fuii-iength nucleotide sequences 

[0209] The functional prediction and categorization of the proteins encoded by the clones were carried out based 
on the result of homology search of the databases of Gen Bank, Swiss-Prot, UniGeneand nr(see the Homology Search 
Result Data) for the full-length nucleotide sequences and the result of domain search of the amino acid sequences 

45 deduced from the full-length nucleotide sequences (see Example 5), 

[0210] The clone predicted to belong to the category of secretory proteinZmembrane protein means a clone having 
hit data with some annotation, such as growth factor, cytokine, hormone, signal, transmembrane, membrane, extra- 
cellular matrix, receptor, G-protein coupled receptor, Ionic channel, voltage-gated channel, calcium channel, cell ad- 
hesion, collagen, connective tissue, etc., suggesting that It is a secretory or membrane protein, or means a clone In 

50 which the presence of nucleotide sequence encoding a signal sequence or transmembrane domain was suggested 
by the results of PSORT and SOSUI analyses for deduced ORF, 

[0211] The clone predicted to belong to the category of glycoprotein-related protein means a clone having hit data 
with some annotation, such as glycoprotein, suggesting that the clone encodes a glycoproteln-reiated protein, 
[0212] The clone predicted to belong to the category of signal transduction-related protein means a clone having hit 
55 data with some annotation, such asserineZthreonlne-protein kinase, tyroslne-protein kinase, SH3 domain, SH2 domain, 
etc., suggesting that the clone encodes a signal transduction-related protein. 

[0213] The clone predicted to belong to the category of transcription-related protein means a clone having hit data 
with some annotation, such as transcription regulation, zinc finger, homeobox, etc., suggesting that the clone encodes 
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a transcription-related protein. 

[0214] The clone predicted to belong to the category of disease-related protein means a clone having hit data with 
some annotation, such as disease mutation, syndrome, etc., suggesting that the clone encodes a disease-related 
protein, or means a clone whose full-length nucleotide sequence has hit data for Swiss-Prot, GenBank, or UnlGene, 

5 where the hit data corresponds to genes or proteins which have been deposited in the Online IVIendelian Inheritance 
in Man (OMIM) (http://www.ncbi.nlm.nih.gov/Omim/), which is the human gene and disease database. 
[0215] The clone predicted to belong to the category of enzyme and/or metabolism-related protein means a clone 
having hit data with some annotation, such as metabolism, oxidoreductase, E. C. No. (Enzyme commission number), 
etc., suggesting that the clone encodes an enzyme and/or metabolism-related protein. 

10 [0216] The clone predicted to belong to the category of cell division and/or cell proliferation-related protein means 
a clone having hit data with some annotation, such as cell division, cell cycle, mitosis, chromosomal protein, cell growth, 
apoptosis, etc., suggesting that the clone encodes a cell division and/or cell proliferation-related protein. 
[0217] The clone predicted to belong to the category of cytosl<eleton-related protein means a clone having hit data 
with some annotation, such as structural protein, cytoskeleton, actin-binding, microtubles, etc., suggesting that the 

'5 clone encodes a cytoskeleton-related protein. 

[0218] The clone which is predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein 
means a clone having hit data with some annotation, such as nuclear protein, RNA splicing, RNA processing, RNA 
helicase, polyadenylatlon, etc., suggesting that the clone encodes a nuclear protein and/or RNA synthesis-related 

20 [0219] The clone predicted to belong to the category of protein synthesis and/or transport-related protein means a 
clone having hit data with some annotation, such as translation regulation, protein biosynthesis, amino-acid biosyn- 
thesis, ribosomal protein, protein transport, signal recognition particle, etc., suggesting thatthe clone encodes a protein 
synthesis and/or transport-related protein. 

[0220] The clone predicted to belong to the category of cellular defense-related protein means a clone having hit 
25 data with some annotation, such as heat shock, DNA repair, DNA damage, etc., suggesting that the clone encodes a 
cellular defense-related protein. 

[0221] The clone predicted to belong to the category of development and/or differentiation-related proteins means 
a clone having hit data with some annotation, such as developmental protein, etc., suggesting thatthe clone encodes 
a development and/or differentiation-related protein. 
30 [0222] The clone predicted to belong to the category of DNA-blndIng and/or RNA-binding protein means a clone 
having hit data with some annotation, such as DNA-binding, RNA-binding, etc. 

[0223] The clone predicted to belong to the category of ATP-binding and/or GTP-bindIng protein means a clone 
having hit data with some annotation, such as ATP-binding, GTP-binding, etc. 

[0224] In this functional categorization, when a single clone corresponded to multiple categories of those shown 
35 above, the clone was assigned to the multiple categories. However, the function of a protein is not restricted to the 
functional category in this classification , and there is the possibility that other functions are newly assigned to the protein , 
[0225] The clones predicted to belong to the category of secretory protein and/or membrane protein are the following 
439 clones. 

3NB691 00001 80, 3NB691 0000850, 3NB6g20000290, 3NB6920003300, 3NB6920005450, 3NB692001 0020, 

40 ADRGL1 00001 80, ADRGL1 0001 600. ADRGL20003230, BGGI1 2001 0970, BNGH41 0000340, BNGH41 0001 040, 
BNGH41 0001 180, BNGH41 0001 370, BNGH41 0001 980. BRACE1 0000730, BRACE1 0001 690, BRACE20002800, 
BRACE20007180, BRACE2001 0650, BRACE20011170, BRACE20011430. BRACE20013400, BRACE20013520, 
BRACE20014230, BRACE20014530, BRACE20014920, BRACE200 15080, BRACE2001 8590, BRACE20022270, 
BRACE20024680, BRACE20026350, BRACE20026850, BRACE20030780, BRACE20031100, BRACE20034490, 

45 BRAGE20071380, BRACE20071 970, BRACE20072810, BRACE2007401 0, BRACE20074470, BRACE20075020, 
BRACE20075380, BRACE2007641 0, BRACE20076630, BRACE20076850, BRACE200 77610, BRACE20077640, 
BRAGE20077980, BRACE20078680, BRACE20079530, BRACE20084430, BRACE20086550, BRACE20089600, 
BRACE20091880, BRAWH10000010, BRAWH1 0000370, BRAWH1 0000940, BRAWH1 0001 620, BRAWH1 0001 800, 
BRAWH20001090, BRAWH20004430, BRAWH20006970, BRAWH20009840, BRAWH20011290, BRAWH20011410, 

50 BRAWH20011660, BRAWH20014380, BRAWH20014840, BRAWH20015030, BRAWH 20036930, BRAWH20038320, 
BRAWH20040950, BRAWH20052250, BRAWH20059980. BRAWH20087060, BRAWH2009261 0, CD34C2000051 0, 
CTONG20013660, CTONG2001 5330, CTONG200281 60, CTONG20037820, CTONG200471 60, DFNES20003350, 
FCBBF1 00061 80, FGBBF1 0006750, FCBBF2000591 0. FCBBF20007330, FGBBF20008150, FGBBF20009400, 
FCBBF20015380, FEBRA20003780, FEBRA20004040, FEBRA200041 50, FEBRA20004520, FEBRA2000491 0, 

55 FEBRA20006560, FEBRA20006900, FEBRA20007330. FEBRA20008090, FEBRA20008800, FEBRA2001 0930, 
FEBRA20012270, FEBRA20012450, FEBRA20012940, FEBRA20013510, FEBRA20014870, FEBRA2001 4920, 
FEBRA20015840, FEBRA20020860, FEBRA20021910, FEBRA20025250, FEBRA20031550, FEBRA20037070, 
FEBRA20041100, FEBRA20041910, FEBRA20057780, FEBRA20063150, FEBRA20066670, FEBRA20067930, 
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HCASM 10000610, HCASM20002020, HEART20000990, HEART20004920, HHDPC20000950, HLUNG1 0000240, 
HLUNG1 0000370, HLUNG10001100, HLUNG20001160, HLUNG20001250. HLUNG20001420, HLUNG20001760, 
HLUNG20002550, HSYRA20003470, HSYRA20006290, HSYRA20008280, HSYRA20011 030, HSYRA20013320, 
HSYRA20014200, HSYRA20G15800, I MR321 0000440, IMR3210001580, IMR321 0002660, 1MR3220007750, 

5 IMR3220008590, IMR3220009840, IMR3220014350, KIDNE1 0000080, KIDNE1 0001 040, KIDNE1 0001 430, 
KIDNE20000700, KIDNE20000850, KIDNE20001 670, KIDNE200031 50, KIDNE20003300, KIDNE20003490, 
KIDNE20004220, KIDNE200051 70, KIDNE20005190, KIDNE20033050, KIDNE20033570, KIDNE2003941 0, 
KIDNE20042620, KIDNE20042950, KIDNE2004411 0, KIDNE20048280, KIDNE2004981 0, KIDNE20054000, 
KIDNE20054770, KIDNE20060530, KIDNE20060620, KIDNE20063530, KIDNE2O063760, KIDNE20066520, 

10 KIDNE20067600, KIDNE20071860, KIDNE20073520, KIDNE20074220, KIDNE20075690, LIVER1 0000580, 
LIVER1 0000670, LIVER1 0001 040, LIVER10001110, LIVER10001750, LIVER1 0005420, LIVER20004160, 
MAMGL1 0000320, MAMGL1 0001 840, MESAN1 0000350, MESAN1 0001 470, MESAN1 0001 800, MESAN20001490, 
NB9N420000420, NHNPC20002060, NT2NE1 0000230, NT2N El 0000830, NT2NE10001630, NT2NE20003270, 
NT2NE20003920, NT2NE20004550, NT2RE20004700, NT2NE20005500, NT2NE20012470, NT2NE20014350, 

15 NT2NE200i6260, NT2NE20034080, NT2NE20047160, NT2NE20055170, NT2NE20057200, NT2RI20005970, 
NT2RI20009740, NT2RI20010100, NT2RI20014490, NT2RI20015400, NT2RI20015950, NT2RI2001 6570, 
NT2RI2001B660, NT2RI20020220, NT2RI20021520, NT2RI20022430, NT2R 120022520, NT2RI20025300, 
NT2RI20030110, NT2RI2003051 0, NT2RI20031 540, NT2RI2003301 0, NT2RI20033830, NT2RI20036780, 
NT2RI20042840, NT2RI20044420, NT2RI20049850, NT2RI20050870, NT2RI20051500, NT2RI20066820, 

20 NT2RI20068250, NT2RI20070480, NT2RI20070840, NT2RI20073030, NT2RI20074980, NT2RI20077540, 

NT2RI2007B270, NT2RI200B0500, NT2RI200818B0, NT2RI20084810, NT2RI20085980, NT2RI20089420, 
NT2RI20092890, NT2RI20094060, NT2RP60000320, NT2RP60000390, NT2RP60001 090, NT2RP70000690, 
NT2RP700023B0, NT2RP70002590, NT2RP70003640, NT2RP70011660, NT2RP7001591 0, NT2RP70021510, 
NT2RP70023760, NT2RP70023790, NT2RP70026190, NT2RP70029820, NT2RP70040800, NT2RP70043730, 

25 NT2RP70047900, NT2RP70049250, NT2RP70055200, NT2RP70064080, NT2RP70071540, NT2RP70071770, 
NT2RP70073810, NT2RP70074220, NT2RP70075040, NT2RP700761 70, NT2RP70079250, NT2RP70079750, 
NT2RP70081330, NT2RP70081370, NT2RP70083150, NT2RP70085500, NT2RP70090120, NT2RP70091490, 
NT2RP70091680, NT2RP70092360, NT2RP70093220, NT2RP70093730: NT2RP70094290, NT2RP7009481 0, 
NT2RP70094980, NT2RP70095070, NT0NG1 0000980, NTONG10002140, NT0NG1 0002570, NTONG20002650, 

30 NTONG20004920, NTONG20008000, NTONG20012220, 0CBBF1 0000420, OCBBF20002310, OCBBF20009980, 
OCBBF20012100, PANCR1 0000210, PLACE50000670, PLACE50000680, PLACE50001 050, PLACE50001130, 
PLACE600i2810, PLACE6001 8860, PLACE60020160, PLACE60020840, PLACE60026990, PLACE60037050, 
PLACE60037450, PLACE60043960, PLACE60044540, PLACE60047380, PLACE60049930, PLACE60050290, 
PR0ST1 0002200, PROST1 0002720, PR0ST1 0005260, PR0ST1 0005360, PROST20000360, PROST20026820, 

35 PROST20029600, PROST20 032320, PROST20033020, PROST20039220, PROST20044160, PROST20051430, 
PROST20054260, PROST20058800, PROST200591 90, PROST20059430, PROST20069880, PROST20072370, 
PROST20073890, PUAEN1 0000570, PUAEN1 0003220, SALGL1 0001 570, SKMUS20007740, SKNMC10000190, 
SKNMC10000290, SKNMCI 0002290, SKNMC1 000251 0, SKNMC20011130, SKNMC20015030, SMINT1 00001 60, 
SMINT1 0000420, SMINT1 0000570, SMINT1 00011 80, SMINT200001 80, SMINT20002770, SPLEN1 000091 0, 

40 SPLEN20001340, SPLEN20002430, SPLEN20002700, SPLEN200031 00, SPLEN20004960, ST0MA1 0000520, 

ST0MA1 0001 170, STOMA20000320, STOMA20002570, SYNOV20001 770, SYNOV2001 6480, TESTI1 0000420, 
TEST1 10000960, TESTI1 0001 270, TESTI1 0001380, TESTI20001 770, TESTI20006000, TESTI20007620, 
TESTI20008830, TESTI20009090, TESTI20009700, TESTI20011340, TESTI20012370, TESTI2001 3520, 
TESTI20014200, TESTI20016210, TESTI20016710, TESTI20018520, TESTI20018620, TESTI20020020, 

45 TESTI20020810, TESTI2002251 0, TESTI20024230, TESTI20024650, TESTI20024670, TESTI20025800, 
TESTI20026320, TESTI20026980, TESTI20027000, TESTI20027070, TESTI20028660, TESTI20030370, 
TESTI20031930, TESTI200341 90, TESTI20036490, TESTI20039980, TESTI20042870, TESTI20047120, 
TESTI20049940, TESTI20056900, TESTI20057420, TESTI20058600, TESTI20067740, TESTI20069780, 
TESTI20074800, TESTI20077490, TESTI20079510, TESTI20080200, TESTI20081440, TESTI20087740, 

50 TESTI20088470, TESTI20136910, THYMU1 0000830, THYMU1 0001 760, THYMU1 0003290, THYMU1 0003820, 
THYMU 10005580, TRACH1 0000630, TRACH1 0001 000, TRACH1 0001 400, TRACH20001 850, TRACH20001 960, 
TRACH20004200, TRACH20004960, TRACH20006650, TRACH20007670, TRACH20008980, TRACH2001 5920, 
UMVEN20001330, UTERU1 0000770, UTERU1 0000960, UTERU1 0001 920, UTERU20000470, UTERU20003930, 
UTERU20004850 

55 [0226] The clones predicted to belong to the category of glycoprotein-related protein are the following 87 clones. 
BNGH41 0000340, BNGH410001180, BRACE20014920, BRACE200 15080, BRACE20018590, BRACE20024680, 
BRACE20026350, BRACE20031100, BRACE20074470, BRAWH 10000370, BRAWH20001 090, BRAWH20011660, 
BRAWH20014840, BRAWH20059980, CD34C2000051 0, CTONG20013660, CTONG20028160, CTONG20037820, 
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FCBBF20007330, FEBRA20007330, FEBRA20008800, FEBRA20014920, FEBRA20015840, FEBRA20057780, 
HEART20005060, HLUNG10001100, HLUNG20002550, HSYRA20013320, I MR32 10002660, IMR3220O07750, 
IMR3220013320, KIDNE2004411 0, KIDNE20063760, KIDNE20067600, KIDNE20073520, LIVER20000370, 
MESAN1000035G, NT2NE1 0000830, NT2NE10001B50, NT2NE20003270, NT2NE2001 6260, NT2RI2001 8660, 

5 NT2RI20025300, NT2RI20036780, NT2RI2Q077540, NT2RI20080500, NT2RI20085980, NT2RI20089420, 

NT2RI20092890, NT2RP70000690, NT2RP70004770, NT2RP70055200, NT2RP70081370, NT2RP700831 50, 
NT2RP70091490, NT2RP70092360, NT2RP70094980, NTONG1 00021 40, OCBBF2000231 0, OCBBF20002770, 
PLACE50000680, PLACE50001130, PLACE6001 8860, PLACE60044540, PROST2001 8230, PROST20032320, 
PROST20073890, SALGL1 0001 570, SKNMC20015030, SMINT1 00001 60, SMINT20002770, SPLEN20001 340, 

10 TESTI1 0001 270, TESTI1 0001 380, TESTI20001770, TESTI20024230, TESTI20027070, TESTI20036490, 
TESTI20039980, TESTI20056900, TESTI20057420, TESTI2007951 0, THYMU1 0001 760, TRACH1 0000740, 
TRACH1 0001 250, TRACH20004200, UTERU20000470 

[0227] The clones predicted to belong to the category of signal transduction-related protein are the following 46 
clones. 

15 ADRGL20000740, ASTR01 00001 80, BRACE20005770, BRACE20022020, BRACE20027360, BRACE20027920, 
BRAWH20006860, CTONG20005890, FEBRA20000350, HHDPC20000550, IMR3220003020, KIDNE20033730, 
KIDNE2G040840, KIDNE20053360, KIDNE20062990, NT2R 120033440, NT2RI2005811 0, NT2RI200621 00, 
NT2RI20073840, NT2RP70006240, NT2RP70043960, NT2RP70046870, NT2RP70061 880, NT2RP70072520, 
NT2RP70081440, NT2RP70093700, NT0NG1 0001 820, PEBLM20004790, PLACE60026680, PROST20033400, 

20 PROST20043320, SKMUS1 0000220, SKMUS20016680, SPLEN20003570, TESTI20001 540, TESTI2000591 0, 
TESTI20022560, TESTI20024980, TESTI20029120, TESTI20034980, TESTI20049820, TESTI20055840, 
THYMU 10003590, THYIVIU20003690, TRACH20002500, TRACH20002890 

[0228] The clones predicted to belong to the category of transcription-related protein are the following 140 clones. 
3NB6920010220, 3NB692001511 0, 3NB6920015570, ADRGL10000650, BGGI120006840, BGGI1 20006930, 
25 BGGI120017140, BNGH41 0000800, BNGH420005320, BRACE 10000930, BRACE20014550, BRACE20018550, 
BRACE20020910, BRACE20024090, BRACE20071740, BRAWH1 0000020, BRAWH1 0001 640, BRAWH1 0001 680, 
BRAWH20006330, BRAWH2000901 0, 

CTONG20025580, CTONG20028200, FCBBF1 0005980, FCBBF20000940, FCBBF2000951 0, FCBBF5000261 0, 
FEBRA20003970, FEBRA20003990, FEBRA20Q04540, FEBRA20009720, FEBRA20011460, FEBRA2001 71 50, 
30 FEBRA20050140, FEBRA20064760, FEBRA20067360, FEBRA20069420, FEBRA20072800, HLUNG1 0000760, 
HLUNG20000680, HSYRA1 0001370, 

HSYRA20016310, IIVIR32 10002420, IMR3220007420, KIDNE20000510, KIDNE20039940, KIDNE20061490, 
KIDNE20078110, NES0P1 0000870, NHNPC10001240, NHNPC20002120, NT2NE20002590, NT2NE20008090, 
NT2RI20003410, NT2RI20004120, NT2RI20004210, NT2RI20010830, NT2RI20018460, NT2RI20025410, 

35 NT2RI20025850, NT2RI20060710, 

NT2RI20067350, NT2RI20071330, NT2RI20074390, NT2RI20078790, NT2RI20087140, NT2RI20090650, 
NT2RI20092150, NT2RP60001000, NT2RP60001270, NT2RP70002710, NT2RP70008120, NT2RP7001 8560, 
NT2RP70024500, NT2RP70032030, NT2RP70036290, NT2RP70042040, NT2RP7004541 0, NT2RP70046560, 
NT2RP70055130, NT2RP70061 620, 

40 NT2RP70062960, NT2RP70064900, NT2RP70069860, NT2RP70075370, NT2RP70085570, NT2RP70087200, 
NT2RP70090190, NTONG20003340, NTONG20003630, NTONG20015500, OGBBF20011010, OCBBF20011240, 
OCBBF20015860, PEBLIV120002480, PEBLM20002700, PEBLM20003080, PEBLI\/120003950, PLACE60002050, 
PLAGE60005550, PLACE60021510, 

PLACE600303B0, PROST20018230, PROST20031170, PROST20073170, PUAEN1 0001 610, SALGL1 0000650, 
45 SKMUS1 0000640, SKMUS20014920, SKNI\/IC20000650, SKNMC20002240, SKNMC20003560, SIVIINTI 0001 000, 
SIVIINT20005450, SPLEN20000200, SPLEN20000720, SYNOV20010140, SYNOV20013740, SYNOV20014510, 
TEST1 10000550, TESTI20001200, 

TESTI20007070, TESTI20010490, TESTI20015560, TESTI20018150, TESTI20018790, TESTI20021490, 
TESTI20026760, TESTI20027890, TESTI2003071 TESTI20034130, TESTI20042290, TESTI20053960, 
50 TESTI20074640, TESTI20074660, TESTI20078640, THYMU1 0004590, TRACH20000790, TRACH20002370, 
TRAGH20009440, UTERU1 0001 600 

[0229] The clones predicted to belong to the category of disease-related protein are the following 21 9 clones. Further, 
hit data of all the clones for Swiss-Prot, or GenBank, UniGene, or nr corresponded to genes or proteins which had 
been deposited in the Online Mendelian Inheritance in Man (OMIM), which is the human gene and disease database, 
55 (the OIVIIM Number is shown in the parenthesis after the Clone Name). 

ADRGL1 0000020 (605332), ADRGL1 0001 600 (201910), ADRGL20000740 (300118), ASTR0200041 70 (605937), 
BGGI120006840 (604480), BGGI120010970 (602346), BGGI120017140 (194631), BNGH41 0001 770 (146690), 
BNGH420005320 (601260), BRACE1 0001 870 (157132), BRACE20006980 (106410), BRACE20007180 (114160), 



90 



EP 1 293 569 A2 



BRACE20014550 (140580), BRACE20018550 (109560), BRACE20018590 (602644), BRACE20027550 (179715), 
BRACE20027720 (138760), BRACE20076850 (605209), BRACE20086550 (603540), BRAWH10000020 (605678), 
BRAWH1 0001 640 (606043), BRAWH20001770 (138450), BRAWH20005030 (179715), BRAWH20005220 (603747), 
BRAWH20006330 (194500), BRAWH20006860 (602958), BRAWH20009840 (601258), BRAWH20011660 (230500; 
5 230600;230650;253010), CD34C2000051 0 (600031), CTONG20005890 (603583), CTONG20019110 (603486), 
CTONG20024180 (602895), CTONG20025580 (601856), CTONG20037820 (602729), CTONG20055530 (106410), 
FCBBF20000940 (601408), FCBBF2000951 0 (194531). FCBBF40002820 (130410), FEBRA20001 050 (600025), 
FEBRA20003990 (601781), 

FEBRA20004150 (126650;214700), FEBRA20004540 (194558), FEBRA20009720 (602277), FEBRA2001 0930 
10 (603878), FEBRA20011460 (603900), FEBRA20050790 (176879), FEBRA20057880 (604362), FEBRA20064760 
(602277), FEBRA20067930 (602921), FEBRA200701 70 (606098), FEBRA20075510 (179513), FEBRA20075660 
(179715), HCASM20002140 (123834), HEART20004480 (191045;115195), HLUNG10001050 (310400), 
HLUNG20000680 (300024), HSYRA10001370 (602277), HSYRA20006400 (601278), HSYRA20013320 (146732), 
HSYRA20016310 (604080), 

15 IMR3210000440 (601890), IMR322000791 0 (313440), KIDNE10001040 (603217), KIDNE20003150 (602417), 
KIDNE20033730 (605216), KIDNE20042950 (120160), KIDNE20044110 (605239), KIDNE20050420 (214500), 
KIDNE20059080 (604276), KIDNE20063760 (231950), KIDNE2007811 0 (603430), LIVER1 0002300 (161015), 
LIVER10004330 (603197), LIVER20000330 (191161), LIVER20000370 (138670), MAMGL1 0001 780 (603403), 
MESAN10001800 (606048), MESAN2000291 0 (142810), M ESAN2000501 0 (602769), NB9N41 0001 350 (179508), 

20 NHNPC10000840 (604819), NHNPC200021 20 (194558), NT2NE1 0000730 (601905), NT2NE20002990 (147625), 
NT2NE20003690 (232000), NT2NE200051 70 (603330), NT2NE20005360 (150370), NT2NE20006580 (605969), 
NT2NE20008090 (603899), NT2NE20013720 (180480), NT2NE20016340 (602184), NT2NE200551 70 (128100), 
NT2RI20004120 (600140), NT2RI20004210 (314997), NT2RI20010910 (601940), NT2RI2001 4500 (190370), 
NT2RI20020410 (168730; 180990), NT2RI200295B0 (605689), NT2RI20031540 (300061), NT2RI20033440 (601014), 

25 NT2RI20041900 (179715), NT2RI20056470 (123940), NT2RI20057230 (601940), NT2RI20067030 (603577), 
NT2RI20070960 (311030), NT2RI20074980 (603105), NT2RI20077540 (300112), NT2RI20080500 (142461), 
NT2RI20083960 (605612), NT2RI20084810 (603099), NT2RI20092150 (600834), NT2RI20092890 (603104), 
NT2RP60000350 (605612), NT2RP60001000 (314995), NT2RP60001230 (600025), NT2RP70000690 (158340; 
113720), NT2RP70004250 (160776), NT2RP7002B750 (179838), NT2RP70029060 (140571), NT2RP70032030 

30 (602277), 

NT2RP70036290 (600005;209920) NT2RP70042600 (160776), NT2RP70046560 (602410), NT2RP70049250 
(601703), NT2RP70055020 (604581), NT2RP70062960 (133540), NT2RP70063040 (604061), NT2RP70065270 
(300111), NT2RP70069B60 (602277), NT2RP70071770 (603046), NT2RP70073810 (601439), NT2RP70074220 
(313440), NT2RP70075370 (109092), NT2RP70079250 (602346), NT2RP70081440 (601335), NT2RP70090120 
35 (602727), NT2RP70090190 (194558), NT2RP70093220 (300008;300009;310468) , NT2RP70094980 (135820), 
NTONG 10002460 (600856), 

NTONG20003630 (600140), NTONG20015500 (604077), OCBBF10001180 (191161), OCBBF20008240 (187790), 
PEBLM1 0000340 (133450), PEBLM20002480 (300024), PEBLM20003080 (604077), PEBLM20003950 (600834), 
PLACE50000800 (601797), PLACE60002050 (600013), PLACE60003790 (603403), PLACE60014430 (603707), 
40 PR0ST1 0001 670 (313440), PR0ST1 0005360 (602346), PROST20002730 (601 985;1 88550), PROST20032320 
(253220), PROST20033400 (300203), PROST20062600 (601940), PROST20072890 (191161), PROST20073890 
(192240), 

PROST20085160 (191030;164970), SALGL10001570 (603743), SKMUS10000140 (191340), SKMUS10001180 
(601402), SKMUS1 0001 290 (604055), SKMUS20000740 (605196), SKMUS20003900 (604850), SKMUS20007240 
45 (604300), SKMUS20016340 (163906), SKNMC1 0002510 (605452), SKNMC20000650 (604078), SKNMC20003220 
(117140), SMINT10000420 (601615), SMINT1 0000570 (604814), SMINT1 0001 000 (603851), SMINTI 0001 030 
(605759), SMINT20004000 (601278), SPLEN10001430 (163905), SPLEN20001 970 (601940), STOMA20000880 
(147220), 

STOMA20003960 (300111), SYNOV20013740 (604076), SYNOV20014510 (600661), SYNOV2001 6480 (131222; 
50 603041), TESTI10001270 (601313;173900), TESTI10001310 (186982), TESTI20001200 (19451 0), TESTI20001 770 
(146650), TESTI20002530 (605440), TESTI20006000 (179838), TESTI20006990 (602591), TESTI20007620 
(1 26650;21 4700), TESTI20008830 (1 60794), TESTI2001 1 BOO (1 90370), TESTI2001 2690. (1 09720), TESTI2001 51 20 
(604700), TESTI20018520 (602346), TESTI2001 8790 (300024), TESTI20021 490 (604073), TESTI200251 60 
(300097), 

55 TESTI20027070 (173335), TESTI20027290 (300127), TESTI200291 20 (600855), TESTI20033250 (168730), 
TESTI20049820 (176894), TESTI20053960 (604074), TESTI20068660 (603395), TESTI20071 830 (605769), 
TESTI20074640 (603899), TESTI20079510 (116930), TESTI20086570 (300153), TESTI20140360 (170100), 
THYMU10000830 (600857), THYMU10001760 (116930), THYMU10003590 (602857), THYMU10004910 (604908), 
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TRACH20002370 (602277), UTERU 10000960 (603931), UTERU20000470 (602070) 

[0230] The clones predicted to belong to the category of enzyme and/or metabolism-related protein are the following 
168 clones. 

3NB6920002810, ADRGL10001600, ADRGL1 0001 650. BGGI120005330, BNGH41 0000340, BNGH41 0001 770, 

5 BRACE 10000420, BRACE20015080, BRACE20022020, BRACE20024680, BRACE20026850, BRACE20027360, 
BRACE20027720, BRACE20027920, BRACE20071 380, BRACE20084430, BRAWH20001 770, BRAWH2000651 0, 
BRAWH20006860, BRAWH20009840, BRAWH20011 660, BRAWH20014180, BRAWH20014840, BRAWH20036890, 
BRAWH20059980, BRAWH20069890, BRAWH20089560, CTONG2001 3660, CTONG20019110, DFNES200021 20, 
FCBBF20007330, FCBBF200 15380, FEBRA20000350, FEBRA20001290, FEBRA2000311 0, FEBRA20024420, 

10 FEBRA20041100, FEBRA20045920, FEBRA20050790, FEBRA200521 60, FEBRA20062700, FEBRA200631 50, 
HEART20000350, HHDPC20000550, HHDPC20004550, HLUNG1 0001 050, HLUNG20002550, HSYRA1 0001 680, 
HSYRA20005100, HSYRA20015740, IMR3220008380, IMR3220009190, IMR3220012180, IMR3220013170, 
KIDNE20000410, KIDNE20003490, KIDNE20004220, KIDNE20005130, KIDNE20033050, KIDNE20040840, 
KIDNE20046810, KIDNE20056290, KIDNE20060530, KIDNE20063760, KIDNE20068800, KIDNE20073280, 

15 KIDNE20073520, KIDNE200781 00, LIVER1 0000670, LIVER1 0002300, MAMGL1 0001 780, MESAN2000291 0, 

MESAN20005010, NT2NE1 0000730, NT2NE1 0001 850, NT2NE20002140, NT2NE20003270, NT2NE20003690, 
NT2NE20005860, NT2NE2001 3720, NT2NE2001 6340, NT2NE2001 6660, NT2RI1 0000480, NT2RI2001 01 00, 
NT2RI20015400, NT2RI20020220, NT2RI20025300, NT2RI2003301 0, NT2RI20036780, NT2RI2003751 0, 
NT2RI20051500, NT2RI20068550, NT2RI20073840, NT2RI20074980, NT2RI2008481 0, NT2RI2008791 0, 

20 NT2RP70004770, NT2RP70006240, NT2RP70011 660, NT2RP700261 90, NT2RP70062960, NT2RP70072520, 
NT2RP70076100, NT2RP70081440, NT2RP70084060, NT2RP70085570, NT2RP70093700, NT0NG1 0001 820, 
OCBBF20008240, OCBBF20012100, OCBBF20014080, OCBBF20014940, PANCR1 0000210, PEBLM20004790, 
PLACE50001050, PLACE50001130, PLACE60003790, PLACE6001281 0, PLACE6001 8860, PLACE60044540, 
PROST20031170, PROST20032320, PROST20033400, PROST20051210, PROST20064500, SKMUS1 0001 290, 

25 SKMUS1 0001 770, SKMUS20000740, SKMUS20007240, SKI\/IUS20008630, SKMUS20009330, SKMUS20011290, 
SKNSH1 0001 740, SKNSH20003470, SMINT1 00001 60, SPLEN20001340, ST0MA1 0001 860, STOMA20001210, 
STOMA20004820, SYNOV20016480, TEST1 10000700, TEST1 1000 1380, TESTI 2000 1540, TESTI2000591 0, 
TESTI20012690, TESTI2001 8270, TESTI20022560, TESTI20027070, TESTI20029120, TESTI200341 90, 
TESTI20034980, TESTI20040000, TESTI20042070, TESTI20042950, TESTI20047120, TESTI20049820, 

30 TESTI201 38320, TESTI201 40360, TESTI30000020, THYMU 10000830, THYMU1 000491 0, THYMU200031 70, 
THYMU20003690, TRACH20000150, TRACH20004720, TRACH20004970, TRACH20009260, UTERU10000960 
[0231 ] The clones predicted to belong to the category of cell division and/or cell proliferation-related protein are the 
following 23 clones. 

BGGI120001610, BRACE20027550, BRACE20076850, BRAWH20005030, BRAWH20005220, FEBRA20075660, 
35 HCASM20002140, HLUNG1 0000640, IMR3220009730, NT2NE20003840, NT2RI20006850, NT2RI20041900, 
NT2RI2005B110, NT0NG1 0002460, NTONG20008780, SKMUS2001 6340, SKNMC20003220, SPLEN1 0001 430, 
TEST1 10001 680, TESTI20001840, 
TESTI20021050, TESTI200351 20, TESTI20057310 

[0232] The clones predicted to belong to the category of cytosl<eleton-related protein are the following 60 clones. 

40 ADRGL1 0000020, BRACE20006980, BRACE20008850, BRACE20027960, BRACE20074470, BRACE20076630, 
BRACE20078820, BRACE20093070, BRAWH20000480, BRAWH20066220, CTONG2001 9550, CTONG200281 60, 
CTONG20055530, DFNES20002680, FCBBF20005910, FEBRA20007720, FEBRA2000881 0, FEBRA20034290, 
FEBRA20043290, FEBRA20072000, HEART20004480, HEART20005200, HLUNG1 0001100, HSYRA20006050, 
IMR3220007910, KIDNE20040840, KIDNE20052960, NT2RI20014090, NT2RI20032220, NT2RI20058510, 

45 NT2RI20090660, NT2RP70000690, NT2RP70004250, NT2RP70028750, NT2RP70042600, NT2RP70049250, 
NT2RP70074220, NTONG20009660, OCBBF20011760, OCBBF20015280, PEBLM1 0000680, PR0ST1 0001 670, 
PROST20033380, TEST1 1 0000420, TEST1 10000510, TESTI20003560, TESTI20004350, TESTI20006000, 
TESTI20006990, TESTI20008490, TESTI20008830, TESTI20011410, TESTI20015110, TESTI20016610, 
TESTI20020570, TESTI20024230, TESTI20031090, TESTI20031170, TESTI20039140, TESTI20078720 

50 [0233] The clones predicted to belong to the category of nuclear protein and/or RNA synthesis-related protein are 
the following 59 clones. 

3NB6920002810, 3NB6920015280, BGGI120005440, BRACE1 00011 50, BRACE20024780, BRACE20027550, 
BRAWH20005030, BRAWH20014180, BRAWH20069890, CTONG200241 80, FEBRA20001290, FEBRA20075660, 
HEART20003090, HLUNG1 0000640, HSYRA1 0001 680, HSYRA200051 00, IIV1R3220008630, IMR322001 21 80, 
55 MAMGL1 0001 780, NT2NE1 0001 850, NT2NE200021 40, NT2NE20003B40, NT2NE2001 6660, NT2NE2005441 0, 
NT2RI20002820, NT2RI20006850, NT2RI20010910, NT2RI20025540, NT2RI20041 900, NT2RI20053350, 
NT2RI20057230, NT2RI2O060720, NT2RI20067030, NT2RI20068550, NT2RI20078840, NT2RI20087490, 
NT2RP70004770, NT2RP70013060, NT2RP70076430, NTONG20008780, PEBLM1 0000340, PLACE50000580, 
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PLACE60003790, PROST20001760, PROST20062600, SKMUS1 0000220, SKMUS20016340, SKNMC20003220, 
SPLEN1 0001 430, SPLEN20001970, TESTI10001680, TESTI20002530, TESTI20007840, TESTI20021050, 
TESTI20029120, TESTI20035120. TESTI20057310, TRACH20003930, TRACH20012890 

[0234] The clones predicted to belong to the category of protein synthesis and/or transport-related protein are the 
5 following 24 clones. 

BRACE20078680, FEBRA2007551 0, IMR3220008380, KIDNE200051 90, KIDNE20050420, IVIESAN2000291 0, 
NB9N410001350, NT2NE20005360, NT2RI20032Q50, NT2RI20032220, NT2RP70000760, NT2RP70076430, 
NT2RP70093940, OCBBF20008240, PLACE50000580, PROST20000530, SKMUS20000740, SKMUS20008630, 
TESTI20007840, TESTI20Q1 51 20, TESTI2001 8690, TESTI2007B720, THYMU1 0005580, UMVEN20001 330 
10 [0235] The clones predicted to belong to the category of cellular defense-related protein are the following 6 clones. 
BRACE20014550, NT2RI20037510, NT2RI20053350, NT2RP70029060, NT2RP70062960, PLACE50001700 
[0236] The clones predicted to belong to the category of development and/or differentiation-related protein are the 
following 19 clones. 

BGG1 120006930, CTONG20028200, FCBBF50002610, FEBRA20014920, FEBRA20017150, FEBRA20060920, 
15 IViAIViGLI 0001 820, NESO PI 0000870, NHNPC1 0001 240, NT2R 120078790, NT2RP70008120, NT2RP7001 8560, 
NT2RP70045410, OCBBF20002770, SALGL1 0000650, SMINTI 0001 000, TESTi 10000550, TESTI20026760, 

TESTI2007ai40 

[0237] The clones predicted to belong to the category of DNA-binding and/or RNA-binding protein are the following 
158 clones. 

20 3NB6920002810, 3NB6920010220, 3NB6920015110, 3NB6920015570, ADRGL1 0000650, BGGII 20006840, 
BGG1 120006930, BNGH41 0000800, BNGH420005320, BRACE200 14550, BRACE2002091 0, BRACE20024090, 
BRACE20024780, BRACE20071 740, BRAWH1 0001 640, BRAWH1 0001 680, BRAWH20000340, BRAWH20006330, 
BRAWH20009010. BRAWH20014180, 

BRAWH20069890, CTONG20025580, CTONG20028200, D3OST20001840, FCBBF1 0005980, FCBBF20009510, 
25 FCBBF50002610, FEBRA20003970, FEBRA20003990, FEBRA20004540, FEBRA20008560, FEBRA20009720, 
FEBRA20017150, FEBRA20017900, FEBRA20050140, FEBRA20064760, FEBRA20067360, FEBRA20069420, 
FEBRA20072800, HEART20003090, 

HLUNG10000760, HSYRA1 0001 370, HSYRA2001 631 0, IIVIR321 0002420, IMR3220007420, IMR3220008630, 
KIDNE2G0G0510, KIDNE2a039940, KIDNE20061 490, KiDNE20078110, NES0P1 0000870, NHNPC1 0000840, 
30 NHNPC10001240, NHNPC20002120, NT2NE20002590, NT2NE20003840, NT2NE20008090, NT2NE2001 6660, 
NT2NE20054410, NT2RI20003410, 

NT2RI20004210, NT2RI20006850, NT2RI2001 0830, NT2RI20010910, NT2RI2002541 0, NT2RI20025850, 
NT2RI20057230, NT2RI20060710, NT2RI20067350, NT2RI20071330, NT2RI20074390, NT2RI20078790, 
NT2RI2007B840, NT2RI20087140, NT2RI20087490, NT2RI20090650, NT2RP60001000, NT2RP60001270, 

35 NT2RP70002710, NT2RP70008120, 

NT2RP70013060, NT2RP70018560, NT2RP70024500, NT2RP70032030, NT2RP70042040, NT2RP7004541 0, 
NT2RP70046560, NT2RP70055130, NT2RP70061620, NT2RP70062960, NT2RP70064900, NT2RP70069860, 
NT2RP70075370, NT2RP70081 670, NT2RP70085570, NT2RP70087200, NT2RP700901 90, NTONG20003340, 
NTONG20008780, NTONG2001 5500, 

40 OCBBF20011010, OCBBF20015860, PEBLM1 0000340, PEBLM20001120, PEBLM20002700, PEBLIVI20003080, 
PLAGE60002050, PLACE60005550, PLACE60021510, PLAGE60030380, PROST20001760, PROST20003250, 
PROST20018230, PROST20031170, PROST20062600, PROST20073170, SALGL1 0000650, SKIVIUSI 0000640, 
SKMUS20014920, SKMUS20016340, 

SKNMC20000650, SKNMG20002240, SKNI\/IC20003220, SKNI\/IC20003560, SMINTI 0001 000, SI\/IINT20005450, 
45 SPLEN1 0001 430, SPLEN20000200, SPLEN20000720, SPLEN20001 970, SYNOV20010140, SYNOV20013740, 
SYNOV20014510, TEST1 10000550, TESTI20001200, TESTI20007070, TESTI20010490, TESTI20013450, 
TESTI2001 5560, TESTI2001 81 50, 

TESTI20021050, TESTI20021490, TESTI20026760, TESTI20027890, TESTI2003071 0, TESTI20033270, 
TESTI20034130, TESTI20035120, TESTI20053960, TESTI20074640, TESTI20074660, TESTI20078640, 
50 THYMU 10004590, TRACH20000790, TRACH20002370, TRACH20009440, TRACH20012890, UTERU1 0001 600 
[0238] The clones predicted to belong to the category of ATP binding and/or GTP-binding protein are the following 
63 clones. 

3NB6920002810, BNGH41 0000390, BRACE20022020, BRAGE20028120, BRAGE20071 380, BRAWH20000480, 
BRAWH20006860, BRAWH20066220, GTONG2001 3200, DFN ES20002680, FEBRA20043290, FEBRA200521 60, 
55 FEBRA20072000, FEBRA2007551 0, HHDPC20000550, HLUNG200011 60, HSYRA1 0001 680, HSYRA200051 00, 
HSYRA20006050, KIDNE20040840, MAMGL1 0001 780, MESAN2000291 0, NB9N41 0001 350, NT2NE20003690, 
NT2NE20005170, NT2NE2001 6660, NT2NE20055170, NT2RI20068550, NT2RI20073840, NT2RP70004250, 
NT2RP70011660, NT2RP70029060, NT2RP70036290, NT2RP70042600, NT2RP70046870, NT2RP70062960, 
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NT2RP70081370, NT2RP70081440, NT2RP70093700, OCBBF20008240, OCBBF20015280, PEBLM20004790, 
PLACE50001700, PLACE60003790, PROST2001 8990, PROST20033400, SKMUS20008630, SMINT1 0000420, 
TESTI20001540, TESTI20003560, TESTI20005910, TESTI20006950, TESTI20006990, TESTI20008490, 
TESTI20015110, TESTI200 16610, TESTI20022560, TESTI20029120, TESTI20034980, TESTI20042290, 
5 TESTI20047120, TESTI20049820, TESTI20057310 

[0239] Among the clones other than the ones shown above, NTONG1 0001 300 is a clone which was predicted to 
highly possibly belong to the category of secretory protein and/or membrane protein based on the result of domain 
search by Pfam. 

FEBRA20017060, NT2RI20066790, SMINT10000710 
10 [0240] The three clones shown above are clones which were predicted to highly possibly belong to the category of 
glycoprotein-related protein based on the result of domain search by Pfam. 

BRACE20080970, BRACE20092120, BRAWH1 0001300, FEBRA20019890, KIDNE20031850, KIDNE20060140, 
l\/IESAN20000920, NB9N41 0000470, NT2RI20071480, NT2RI2007B910, NT2RP700B8550, NTONG20016120, 
0CBBF1 0000910, PROST20094830, SKNSH1 000301 0, SPLEN20002670, TESTI20031 960, TESTI20036250, 
15 TESTI2003781 0, TESTI20G83870, TESTI201 77400 

[0241] The 21 clones shown above are clones which were predicted to highly possibly belong to the category of 
signal transduction-related protein based on the result of domain search by Pfam. 

3NB6920009120, 3NB692001 471 0^ BRACE1 0001 660, BRACE20083850, BRAWH20004760^ BRAWH20012030, 
CTONG20011390, CTONG2001 8200, FEBRA20007870, FEBRA20043250, HHDPC20003150^ NT2RI1 0000270, 
20 NT2RI20036950, NT2RI20053680, NT2RI20072540, NT2RI20083360, NT2RP70030550, OCBBF2001 3070, 
OCBBF20015270, PLACE60046630, PR0ST1 0003430, PROST20067370, SKMUS1 0001 040, SKNMC2001 5960, 
TESTI20030050, TESTI20033540, TESTI20035890, TESTI20068720, TRACH2000411 0 

[0242] The 29 clones shown above are clones which were predicted to highly possibly belong to the category of 
transcription-related protein based on the result of domain search by Pfam. 

25 BNGH41 0001 900, BRACE20080970, BRACE20092120, BRAWH20093600, FEBRA20003770, FEBRA20024290, 
HLUNG1 0000990, KIDNE20004030, MESAN20000920, NB9N420001040, NT2NE1 00001 40, NT2NE20001740, 
NT2RI20050610, NT2RI20055640, NT2RI20072540, NT2R 120074690, NT2RP60000860, NT2RP70036470, 
NT2RP70036800, NT2RP7007221 0, NT2RP70074060, NT2RP70084870, NT0NG1 0001 300, NT0NG1 0002640, 
NTONG20016120, 0CBBF1 0000910, OCBBF10001190, OCBBF200071 90, SKMUS200D1170, SKMUS2001 6620, 

30 SKNMC20000970, SKNMC20015960, SYN0V1 0001 280, TESTI20002380, TESTI20006270, TESTI2001 3300, 
TESTI20031520, TESTI20036250, TESTI2003781 0, TESTI20064830, TESTI20083870, TRACH20006750, 
TRACH20016070 

[0243] The 43 clones shown above are clones which were predicted to highly possibly belong to the category of 
enzyme and/or metabolism-related protein based on the result of domain search by Pfam. 
35 NT2RI20064120 

[0244] The clone shown above is a clone which were predicted to highly possibly belong to the category of cell 
division and/or cell proliferation-related protein based on the result of domain search by Pfam. 
BRACE20083800, KIDNE20004970 

[0245] The two clones shown above are clones which were predicted to highly possibly belong to the category of 

40 cytoskeleton-related protein based on the result of domain search by Pfam. 

3NB6920009120, 3NB692001 471 0, BRACE1 0001 660, BRACE20083850, BRAWH20004760, BRAWH20012030, 
BRAWH20064500, CTONG20011390, CTONG2001 8200, FEBRA20007B70, FEBRA20043250, HCASIVI20003070, 
HHDPC20003150, NT2RI1 0000270, NT2RI20036950, NT2RI20053680, NT2RI20072540, NT2RI20083360, 
NT2RP70012310, NT2RP70030550, NT2RP70036470, OCBBF20013070, OCBBF20015270, PLACE60046630, 

45 PR0ST1 0003430, PROST20067370, SKMUS1 0001 040, SKNMC20000970, SKNI\/IC20015960, TESTI20030050, 
TESTI20032280, TESTI20033540, TESTI20035890, TESTI20068720, TRACH2000411 0 

[0246] The 34 clones shown above are clones which were predicted to highly possibly belong to the category of 

DNA-binding and/or RNA-binding protein based on the result of domain search by Pfam. 

NT2RI20064120 

50 [0247] The clone shown above is a clone which was predicted to highly possibly belong to the category of ATR- 
binding and/or GTP-binding protein based on the result of domain search by Pfam. 

[0248] The 178 clones shown below are clones which were unassignable to any of the above-mentioned categories, 
but have been predicted to have some functions based on homology search for their full-length nucleotide sequences 
and motif search in their deduced ORFs. Clone Name, Definition in the result of homology search or Motif Name in 
55 the motif search, demarcated by a double slash mark (//), are shown below. 

3NB6910001160//STEROIDOGENIC ACUTE REGULATORY PROTEIN PRECURSOR. 

3NB6910001290//KRAB box 

3NB691 0001 730//RI01/ZK632.3/MJ0444 family 
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3NB6920014330//Domain of unknown function 

ASTR020000950//SNAP-25 family 

BNGH41 0000030//R.norvegicus trg mRNA. 

BNGH41 0000290//SPRY domain 
5 BRACE20005250//DRR1 PROTEIN CTU3A PROTEIN). 

BRACE20005650//ATP synthase ab C terminal 

BRACE20013750//Hepatitis C virus non-structural protein NS4a 

BRACE20014770//HUNTINGTIN ASSOCIATED PROTEIN 1 (HAP1). 

BRACE20016730//MUS musculus mdgl-1 mRNA, complete cds. 
10 BRACE2001 7370//P.vivax pval gene. 

BRAGE20019440//Protein of unl<nown function DUF82 

BRAGE2DD2431D//P53-INDUCED PROTEIN 11. 

BRAGE20028960//MUS musculus mRNA for Ca2+ dependent activator protein for secretion, complete cds. 
BRACE20077840//Putative Protein that mediates attachment of autophagosomes to microtubules, by similarity to yeast 

'5 aut2 [Schizosaccharomyces pombe]. 

BRACE20093610//Bacterial type II secretion system protein 
BRAWH20003230//Proline rich protein 

BRAWH20009440//Arabidopsis thaliana pollenless3 (178) gene, complete cds; 
beta-9 tubulin (TUB9) gene, partial cds; and unknown gene. 
20 BRAWH2001 461 0//TS-N domain// UBA domain 
BRAWH20060440//PPR repeat 
BRAWH20076050//LORICRIN. 

CTONG20027210//VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN VPS13. 

CTONG20028030//Domain of unknown function DUF19// Ribosomal protein SI 8 
25 CTONG20064490//Drosophila melanogaster26S proteasome regulatory complex subunit p42A mRNA, complete cds. 

DFNES20004320//Homo sapiens ubiquitous TPR-motif protein Yisofonn (UTY) gene, partial cds; alternatively spliced. 

FCBBF10006870//MUS musculus Rap2 interacting protein 8 (RPIP8) mRNA, complete cds. 

FCBBF20002320//T-box 

FCBBF20002760//ALPHA SCRUIN. 
30 FCBBF20012110//Leishmanla major partial ppg1 gene for proteophosphoglycan. 

FCBBF20016720//Domain of unknown function DUF94 

FEBRA20000530//Drosophila melanogaster Diablo (dbo) mRNA, complete cds. 

FEBRA20005360//Homo sapiens paraneoplastic cancer-testis-brain antigen (MAS) mRNA, complete cds. 
FEBR/V20007570//Homo sapiens BI\/l-009 mRNA, complete cds. 
35 FEBRA20011330//26S PROTEASOME REGULATORY SUBUNIT S3 (PROTEASOME SUBUNIT P58). 
FEBRA20030540//Halocynthia roretzi mRNA for HrPET-1 , complete cds. 
FEBRA20044900//R.norvegicus mRNA for CPG2 pratein. 
FEBRA20048180//DRR1 PROTEIN (TU3A PROTEIN). 

FEBRA20053800//Homo sapiens ubiquitous TPR-motif protein Y isofonn (UTY) gene, partial cds; alternatively spliced. 

40 FEBRA20057260//TBC domain 
FEBRA20068730//Trg protein 

HCASM1 000021 0//Plasmodlum berghei strain NYU2 merozoite surface protein-1 mRNA, partial cds. 
HCASM20005360//Macrophage migration inhibitory factor 
H EART200041 1 0//POT family 
45 HEART20005680//Nerve growth factor family 

HHDPC20001150//MUS musculus putative secreted protein ZSIG37 (Zsig37) mRNA, complete cds. 
HHDPC20001490//MUS musculus partial mRNA for muscle protein 534 (mg534 gene). HHDPC20004560//2S seed 
storage family 

HHDPC20004620//FAD binding domain 
50 HSYRA1 00011 90//PROBABLE GYP7 PROTEIN (FRAGMENT). 

HSYRA1 0001 780//Alpha-2-macroglobulin family N-terminal region 

HSYRA20001350//CELL POLARITY PROTEIN TEA1 . 

HSYRA20014760//von Willebrand factor type A domain 

HSYRA2001 621 0//HesB-like domain 
55 IMR3220002230//HINT PROTEIN (PROTEIN KINASE C INHIBITOR 1) (PKCI-1) (17 KD (NHIBITOR OF PROTEIN 

KINASE C). 

IMR3220014910//Rattus norveglcus tricarboxylate carrier-like protein mRNA, complete cds. 
KIDNE10001520//MUS musculus yolk sac pemnease-like molecule 1 (YSPL-1) mRNA, complete cds. 
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KIDNE20003750//MUS musculus mRNA for granuphilin-a, complete cds. 

KIDNE20005740//Staphylococcus epidermidis putative cell-surface adhesin SdrF (sdrF) gene, complete cds. 
KIDNE20043440/A/acuolar protein sorting-associated protein - fission yeast 
KiDNE20056760//NEURONAL PROTEIN. 
5 KiDNE20060300//Gaiius gailus syndesmos mRNA, complete cds. 
KiDNE20062480//Scorpion short toxins 

KiDNE20067750//Homo sapiens PTOV1 (PTOV1) gene, complete cds. 

LiVER10000790//Rattus norvegicus fertility related protein WIVIPI mRNA, complete cds. 

iVlAiVlGL10000560//K-box region 
10 iVlESAN 10001 01 0//Rattrg gene product 

NB9N420004950//PROBABLE NUCLEAR ANTIGEN. 

NT2NE1 00001 80//SUPPRESSOR PROTEIN SRP40. 

NT2NE10000630//Gallus gailus Dach2 protein (Dacli2) mRNA, complete cds. 

NT2NE20007630//iVlatrix protein (iVIA), p15 
'5 NT2NE200i3370//Homo sapiens estrogen-responsive B box protein (EBBP) mRNA, complete cds. 

NT2NE20016970//iVISF1 PROTEIN. 

NT2NE20035690//Homo sapiens pliosphoinositoi 3-phosphate-binding protein-2 (PEPP2) mRNA, complete cds. 

NT2NE20053710//Ank repeat 

NT2Ri20006690//TRiCHOHYALiN. 
20 NT2RI20013420//MUS musculus cyclin ania-6b mRNA, partial cds. 

NT2RI20013850//Homo sapiens P381 P (P38iP) mRNA, complete cds. 

NT2RI20015190//Homo sapiens misato mRNA, partial cds. 

NT2RI20016210//Probabletransposase - human transposon MER37 

NT2RI20022700//X123 protein 
25 NT2RI200251 70//Homo sapiens PARS (PARS) mRNA, complete cds. 

NT2RI20029260//ARP2/3 COMPLEX 16 KDA SUBUNIT (P16-ARC). 

NT2Ri20029700//EF hand// EF hand 

NT2Ri20043040//Homo sapiens NY-REN-2 antigen mRNA, complete cds. 
NT2Ri20046060//K+ channel tetramerisation domain 
30 NT2Ri20061830//Proiine-rich protein Ml 4 precursor 

NT2RI20065060//Drosophiia meianogaster rudimentary gene, intron 3; anon-ISAB gene, complete cds. 
NT2RI20077230//Homo sapiens BR13 mRNA, complete cds, 

NT2RI20082210//CORNIFIN B (SMALL PROLINE-RICH PROTEIN IB) (SPR1B) (SPR1 B). 
NT2RI2008B120//AXONEME-ASSOCIATED PROTEIN MST101(2). 
35 NT2RI20091440//SPRY domain 

NT2RP600000B0//Homo sapiens Pig11 (PIG11) mRNA, complete cds. 

NT2RP60000720//Pinus taeda clone PtaAGP6 putative arabinogalactan protein mRNA, complete cds. 

NT2RP70009060//i\/ledicago truncatuia mRNA for 85p protein (85p gene). 

NT2RP7001 0800//iVlus musculus mRNA for Mill (IVliwi like), complete cds. 
40 NT2RP70022430//Tax1 -binding protein TRX - human. 

NT2RP70028290//Scm-reiated gene containing four mbt domains [Mus musculus]. 

NT2RP70033040//YceA protein homoiog ybfQ - Bacillus subtilis. 

NT2RP70036320//Microfilarial sheath protein 

NT2RP700S9600//Calpain inhibitor repeat 
45 NT2RP70042330//HYPOTHETICAL PROTEIN MJ0941 . 

NT2RP70049150//MUS musculus mRNA for UBE-1c1 , UBE-1c2, UBE-1c3, complete cds. 

NT2RP70052050//Human transfomnation-related protein mRNA, 3' end. 

NT2RP7008441 0//Polybromo 1 protein - chicken 

NTONG10000520//Rattus norvegicus mRNA for Kelch related protein 1 (krpi gene). 

50 NTONG10001230//iVlus msuculus mRNA, partial cds, clone CLFEST42. 
OCBBF10001220//RING CANAL PROTEIN (KELCH PROTEIN), 
OCBBF20010750//Spectrin repeat 

OCBBF20011400//VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN VPS8, 
OCBBF20014020//MUS musculus NSD1 protein mRNA, complete cds. 
55 PEBLM10001440//Trg 

PEBLM20002130//Mus musculus genes for integrin aM290, hapsin, partial and complete cds, 
PLACE50000370//Homo sapiens mRNA for hVPSII , complete cds. 
PLACE60004290//Gag P30 core shell protein 
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PLACE60021020//lntegrase Zinc binding domain// Integrase Zinc binding domain// DnaJ central domain (4 repeats) 

PLACE600241 90//TRICHOHYALIN. 

PLACE60032040//Hirudin 

PLACE60033990//SP1 DROIN 1 (DRAGL INE SILK FIBROIN 1) (FRAGMENT). 
5 PLACE6003B500//Homo sapiens mitochondrial solute carrier mRNA, complete cds. 

PLACE60043970//Takifugu rubrlpes retinitis pigmentosa GTPase regulator-lll<e protein gene, partial cds. 

PLACE60044640//Human placenta (Diff48) mRNA^ complete cds. 

PROST20023380//Cca3 protein 

PR0ST20034720//IMMEDIATE-EARLY PROTEIN. 
10 PR0ST20079740//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT). 

SALGL1 0000050//Permeases for cytosine/purines, uracil, thiamine, allantoin 

SALGL10000470//NG36 [Homo sappiens] 

SKMUS20002710//Hepatitis C virus capsid protein 

SKMUS20003650//Human (p23) mRNA, complete cds. 
15 SKMUS20004580//MUS musculus N-RAP mRNA, complete cds. 

SKMUS20009020//BR01 PROTEIN. 

SKMUS20009540//Homo sapiens F-box protein Fbx25 (FBX25) mRNA, partial cds. 
SKMUS2001 0080//MUS musculus mRNA for a skeletal muscle and cardiac protein. 
SKMUS20011470//MUS musculus RP42 mRNA, complete cds. 
20 SKMUS20013640//Laminin EGF-like (Domains III and V) 

SKMUS20015430//Homo sapiens HDCMC29P mRNA, partial cds. 
SKNMC20010570//F-box domain. 

SI\/IINT20001450//Halocynthia roretzi mRNA for HrPET-3, complete cds. 

SMINT20002270//Dlslntegrin// Trans-activation protein X 
25 SMINT20003960//A kinase anchor protein AKAP-KL isoform 2 

STOIVIA20002890//Adaptin N temiinal region 

SYNOV2000291 0//Arabinogalactan-Iike protein 

SYNOV20008200//rrichoplusia nitransposon IFP2 

TESTI10000250//M .musculus mRNA for testis-specific protein, DDC8. 
30 TEST1 1 0000640//Fugu rubrlpes sex comb on midleg-like 2 protein (SCML2) gene, complete cds. 

TEST1 10001 91 0//Homo sapiens 88-kDa Golgi protein (GM88) mRNA, complete cds. 

TESTI20000440//TRICHOHYALIN. 

TESTI20002070//NIFU-LIKE PROTEIN. 

TESTI20002080//Homo sapiens mRNA for Gab2, complete cds. 
35 TESTI20014120//TRICHOHYALIN. 

TESTI2001 6650//IMIVIEDIATE-EARLY PROTEIN. 

TESTI20022230//Ghlamydomonas reinhardtii strain 1132D- flagellar 

protofilament ribbon protein (RIB43a) mRNA, complete cds. 

TESTI20022940//IVIOB2 PROTEIN (MPS1 BINDER 2). 
40 TESTI20024610//TRICHOHYALIN. 

TESTI20030590//TESTIS-SPECIFIC PROTEIN PBS13. 

TESTI20030740//TRICHOHYALIN. 

TESTI20031300//TPR Domain 

TESTI20033560//F-box domain. 
45 TESTI20035510//Proliferating-cel nucleolar antigen P120-like protein - Archaeoglobus fulgidus. 

TESTI20035740//A-KINASE ANCHOR PROTEIN 150 (AKAP 150) (GAMP-DEPENDENT PROTEIN KINASE REGU- 
LATORY SUBUNIT II HIGH AFFINITY BINDING PROTEIN) (P150) (FRAGMENT). 

TESTI20038940//IQ calmodulin-binding motif// IQ calmodulin-binding motif// IQ calmodulin-binding motif 

TESTI20040310//Protein of unknown function DUF84 
50 TESTI20041220//Babesla bigemina 200 kDa antigen p200 mRNA, partial cds. 

TESTI20052680//Rattus norveglcus RSD-6 mRNA, complete cds. 

TESTI20054080//SERrrHR-RICH PROTEIN T10 IN DGCR REGION. 

TESTI20065720//PROTEIN D52 (N8 PROTEIN). 

TESTI20078670//RING CANAL PROTEIN (KELCH PROTEIN). 
55 TESTI20080330//Ribosomal protein L14p/L23e 

TESTI20083430//TPR Domain 

THYMU10000020//Homo sapiens mRNA for Golgi protein (GPP34 gene). 

THYMU1 000291 0//Homo sapiens AP-4 adaptor complex beta4subunit mRNA, complete cds. 
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THYMU20002360//Pumilio-family RNA binding domains (aka PUiVI-HD, Pumilio liomology domain) 
TRACH1 0000300//Anabaena PCC7120 hetC gene, complete cds. 

TRACH20007800//Homo sapiens PTH-responsive osteosarcoma B1 protein (B1) mRNA, complete cds. 
TRACH20008940//PROTEIN TSG24 (MEIOTIC CHECK POINT REGULATOR). 
TRACH20013950//Homo sapiens NY-REN-25 antigen mRNA^ partial cds. 
UMVEN10001220//Cortlcotropln-reieasing factor family 

[0249] With respect to the remaining 476 clones, there are so far no Information available for estimating their func- 
tions. However there Is the possibility that the functions of these clones will be revealed In future. Their Clone Names 
are indicated below. 

3NB6920013490, 3NB692001 6370, 3NB6920017190, ADRGL1 0001 820, ADRGL20004280, ASTRO20004800, 
BGG1 11 0002850, BNGH41 00001 30, BNGH41 00001 70, BNGH41 0000330, BNGH41 0001530, BNGH420004740, 
BRAGE1 0000200, BRACE1 0000700, BRACE1 0001 590, BRACE20000770, BRACE20001000, BRACE20001410, 
BRAGE20003320, BRACE20004210, 

BRAGE20005050, BRACE20005450, BRACE20009880, BRACE20010700, BRACE20011 880, BRACE20013740, 
BRACE20015430, BRAGE20016920, BRAGE2001B650, BRAGE200 18980, BRAGE20020500, BRACE20021510, 
BRAGE20021760, B RAG E20 024950, BRAGE20025900, BRACE20027520, BRAGE20028600, BRAGE20028610, 

BRAGE20032a5Q, BRACE200331 90, 

BRACE20033980, BRACE2003431 0, BRACE200351 60, BRACE20035270, BRACE20035390, BRACE20035940, 
BRAGE20071530, BRACE2007201 0, BRACE20072320, BRACE20075270, BRACE20075630, BRACE2007621 0, 
BRAGE20076460, BRACE20077080, BRACE20077270, BRACE20077670, BRAGE20077680, BRAGE20079020, 
BRAGE20081140, BRACE20084800, 

BRAGE200B4880, BRAGE20086530, BRACE200870B0, BRAGE20087540, BRAGE200B8570, BRACE20089990, 
BRAGE20090140, BRACE20092740, BRACE20092750, BRAGE20093110, BRAGE20094370, BRACE20095170, 
BRAWH1 0000070, BRAWH1 0001 740, BRAWH20000930, BRAWH20002480, BRAWH20005540, BRAWH20008660, 
BRAWH20008920, BRAWH20011030, 

BRAWH20047310, BRAWH20064930, BRAWH20069600, BRAWH20074060, BRAWH20089030, BRAWH20092270, 
BRAWH20094850, GTONG20003030, GTONG2000771 0, GTONG20008270, GTONG20020730, GTONG20021 430, 
CTONG20024530, GTONG20029650, DFNES20002920, FCBBF1 0006860, FCBBF1 000691 0, FCBBF1 0007320, 
FCBBF1 0007600, FCBBF20001050, 

FGBBF20001950, FCBBF20005760, FGBBF20006770, FGBBF20008080, FGBBF20012990, FGBBF2001 4800, 
FGBBF200171B0, FCBBF200 17200, FEBRA20003300, FEBRA20003910, FEBRA20006800, FEBRA20007400, 
FEBRA20007710, FEBRA20008740, FEBRA20009010, FEBRA20009590, FEBRA20011970, FEBRA20015900, 
FEBRA20015910, FEBRA20021940, 

FEBRA20027270, FEBRA20027B30, FEBRA20028820, FEBRA20028970, FEBRA20029080, FEBRA20033080, 
FEBRA20042240, FEBRA20042370, FEBRA20042930, FEBRA200441 20, FEBRA20044430, FEBRA20053770, 
FEBRA20054270, FEBRA20057520, FEBRA20059980, FEBRA20061500, FEBRA20063540, FEBRA20066270, 
FEBRA20074140, FEBRA20074580, 

FEBRA20076220, HCASM10001150, HCASM20005340, HLUNG1 0000300, HLUNG20003140, HLUNG200041 20, 
HLUNG20004800, HLUNG2000501 0, HSYRA1 0001 480, HSYRA20002480, HSYRA20002530, HSYRA20007600, 
HSYRA20011530, IMR321 0000740, IMR321 0000750, 1IVIR321 0001 650, IMR3220006090, IMR3220009350, 
IMR3220009530, IMR3220011850, 

IMR3220016000, IMR3220017240, KIDNE1 0000280, KIDNE1 0000500, KiDNE10001450, KIDNE20001 920, 
KIDN £20002440, KIDNE20002450, KIDNE20002660, KIDNE20033350, KIDNE20033770, KIDNE20037520, 
KIDNE20040340, KIDNE20040540, KIDNE20042940, KIDNE20045200, KIDNE20045340, KIDNE20045790, 
KIDNE20048640, KIDNE20048790, 

KIDNE20059370, KIDNE20070050, KIDNE20070770, KIDNE20073560, LIVER1 0000990, LIVER1 0002780, 
LIVER1 0003030, LIVER20004460, LIVER20005150, IVIAIVIGL1 0000350, IVIESAN20002670, l\/IESAN20003370, 
NB9N410001210, NB9N41 0001 460, NHNPG1 0001 010, NT2N El 0000040, NT2NE1 0001 200, NT2NE20000380, 
NT2NE20000560, NT2NE20000640, 

NT2NE20006360, NT2NE20007060, NT2NE20007870, NT2NE20008020, NT2NE20009800, NT2NE20011560, 
NT2NE200 13240, NT2NE20013640, NT2NE20014030, NT2NE200142B0, NT2NE20015300, NT2NE2001 6230, 
NT2NE200 16480, NT2NE20044900, NT2RI1 00001 60, NT2RI1 0001 640, NT2RI20000640, NT2RI20002700, 

NT2RI20002940, NT2RI2000671 0, 

NT2RI20007380, NT2Ri20008650, NT2RI2001 2350, NT2RI20012440, NT2RI200141 00, NT2RI2001 7260, 
NT2RI20026540, NT2Ri20028020, NT2RI20028520, NT2Ri200301 90, NT2RI20030670, NT2RI20033040, 
NT2RI20033380, NT2RI20035560, NT2RI20040590, NT2RI20043980, NT2RI20047830, NT2RI20048400, 
NT2RI20049160, NT2RI20049840, 

NT2RI20056280, NT2RI20061270, NT2RI20063450, NT2RI20064870, NT2RI20065530, NT2RI20066670, 
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NT2RI20067880, NT2RI20071160, NT2RI20072140, NT2RI20073860, NT2RI20075070, NT2RI20075720, 
NT2RI20075890, NT2RI20077290, NT2RI2007751 0, NT2RI20085260, NT2RI20086560, NT2RI20088010, 
NT2RI20090830, NT2RP60000170, 

NT2RP60000590, NT2RP7000041 0, NT2RP70003910, NT2RP70005790, NT2RP70013350, NT2RP70024490, 
5 NT2RP70025540, NT2RP7002841 0, NT2RP70030500, NT2RP7003091 0, NT2RP7004751 0, NT2RP70047660, 
NT2RP70049750, NT2RP700521 90, NT2RP70054680, NT2RP70054930, NT2RP70063740, NT2RP70066210, 

NT2RP70067010, NT2RP70069800, 

NT2RP70071140, NT2RP70073590, NT2RP70079300, NT2RP70081420, NT2RP70086230, NT2RP700921 50, 
NT2RP70092590, NT2RP70093630, NT2RP70093970, NT2RP70094660, NT2RP70095020, NT0NG1 0000330, 
10 NTONG20005830, NTONG20009850, NTONG20011370, NTONG20014280, OCBBF1 0000670, 0CBBF1 0000860, 
OCBBF1 0001 040, OCBBF20000130, 

OCBBF20001260, OCBBF20002870, OCBBF20009040, OCBBF20017060, PANCR1 0001 850, PEBLM1 0000290, 
PEBLM1 0001 800, PEBLM20000300, PEBLM20001260, PEBLM20001470, PLACE50001530, PLACE60000440, 
PLACE60000700, PLACE60000800, PLACE60001370, PLACE60002630, PLACE6000371 0, PLACE60004240, 

15 PLACE50005230, PLACE60005500, 

PLACE60009530, PLACE60012940, PLACE60019230, PLACE6001 9250, PLACE60026920, PLACE60029490, 
PLACE60030940, PLACE6G031 090, PLACE60033720, PLACE60037400, PLACE60040050, PLACE600431 20, 
PLACE60043360, PLACE6004491 0, PLACE60046870, PLACE6004931 0, PR0ST1 0001 520, PR0ST1 0002460, 
PR0ST1 0005640, PROST20002060, 

20 PROST20002670, PROST20 002740, PROST20004630, PROST20017390, PROST2001 7960, PROST2001 9980, 
PROST20021620, PROST20 025910, PROST20028420, PROST20031020, PROST20032100, PROST20033030, 
PROST20037320, PROST20044810, PROST20056040, PROST20061960, PUAEN1 0000810, SKMUS10001240, 
SKMUS20003430, SKMUS20004670, 

SKMUS20004680, SKMUS20008470, SKMUS20009450, SKMUS20015010, SKMUS20016080, SKMUS20016310, 
25 SKMUS20016710, SKNMC1 0000070, SKNMC1 00001 00, SKNMC1 0001 100, SKNMC1 0001 590, SKNMC1 0001 680, 
SKNMC1 0002640, SKNMC20003050, SKNMC20005930, SKNMC20006120, SKNMC20015550, SKNSH 10000860, 
SKNSH1 0003080, SKNSH20001510, 

SKNSH20001630, SMINT1 0000390, SMINT1 0000540, SMINT20000400, SMINT20002390, SMINT20005580, 
SPLEN1 0000490, SPLEN20000470, SPLEN20002420, SPLEN20004430, SPLEN20005410, ST0MA1 0000470, 
30 ST0MA1 0001 330, STOMA20001880, STOMA20004780, SYN0V1 0001 640, SYNOV20011440, SYNOV20014570, 
TEST1 10000230, TEST1 10001 250, 
TEST1 10001 630, TESTI1 0001 790, 
TESTI20005200, TESTI2000671 0, 
TESTI20010820, TESTI20013060, 
35 TESTI2001 8260, TESTI2001 8290, 
TESTI20018980, TESTI20019500, 
TESTI20022450, TESTI20022640, 
TESTI20028060, TESTI20028400, 
TESTI20033760, TESTI200341 80, 
40 TESTI20035410, TESTI20035800, 
TESTI20051550, TESTI20054920, 
TESTI20073580, TESTI20074020, 
TESTI20082400, TESTI20084400, 
THYMU 10000320, THYMU1 0001 050, THYMU1 0003660, THYMU1 0004730, THYMU1 0005270, THYMU20001400, 
45 TRACH1 00001 80, TRACH10000570, TRACH1 0001 060, TRACH20002350, TRACH20004610, TRACH20011920, 
TRACH20014000, UTERU20003380, UTERU20005410, UTERU20005690 

EXAMPLE 7 

50 Expression frequency analysis in silico 

[0250] The cDNA libraries derived from various tissues and cells as indicated in Example 1 were prepared, and 
cDNA clones were selected from each library at random. The 5'-end sequences were determined and the database 
was constructed based on the data. The database was constructed based on the nucleotide sequences of 770,546 
55 clones, and thus the population of the database is large enough for the analysis. 

[0251] Then, clones having a homologous sequence are categorized into a single cluster (clustering) by searching 
the nucleotide sequences of respective clones in this database with the program of nucleotide sequence homology 
search; the number of clones belonging to each cluster was determined and normalized for every library; thus, the 
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TESTI20000180, 
TESTI2000B190, 
TESTI20015930, 

TESTI20019680, 
TESTI20023610, 
TESTI20029650, 

TESTI20037270, 
TESTI20062380, 
TESTI20076130, 



TESTI20001790, 
TESTI20008300, 
TESTI20017580, 

TESTI20019910, 
TESTI20023690, 
TESTI20032550, 

TESTI20041110, 
TESTI20062550, 
TESTI20077500, 



TESTI20003720, 
TESTI20009510, 
TESTI20017660, 

TESTI20020480, 

TESTI20024150, 
TESTI20032800, 

TESTI20042430, 
TESTI20064250, 
TESTI20081390, 



TESTI20004620, 
TESTI20010080, 
TESTI20017920, 

TESTI20020900, 

TESTI20025440, 
TESTI20032990, 

TESTI20049290, 
TESTI20069790, 
TESTI20082340, 
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ratio of a certain gene in each cDNA library was deterniined. This anaiysis gave the information of the expression 
frequency of genes in tissues and celis which were sources of the cDNA libraries. 

[0252] Then, in order to anaiyze the expression of a gene containing the nucleotide sequence of the cDNA of the 
present Invention in tissues and ceils, the iibrary derived from a tissue or a cell used in the large-scale cDNA analysis 
5 was subjected to the comparison of the expression levels between tissues or cells Namely, the expression frequency 
was analyzed by comparing the previously nonnalized values between tissues and/or cells for which the nucleotide 
sequences of 600 or more cDNA clones had been analyzed. By this analysis, some of the genes were revealed to be 
involved in the pathology and functions indicated below. Each value in Tables 3 to 39 shown below represents a relative 
expression frequency; the higher the value, the higher the expression level. 

10 

Osteoporosis-related genes 

[0253] Osteoporosis is a pathology in which bones are easily broken owing to overall decrease in components of 
bone. The onset involves the balance between the functions of osteoblast producing bone and osteoclast absorbing 
15 bone, namely bone metabolism. Thus, the genes involved in the increase of osteoclasts differentiating from precursor 
cells of monocyte/macrophage line (Molecular i\^edicine 38. 642-648. (2001)) are genes involved in bone metabolism 

associated with osteoporosis. 

[0254] A nucleotide sequence information-based analysis was carried out to identify the genes whose expression 
frequencies are higher or lower in CD34-I- cell (cell expressing a glycoprotein CD34) treated with the osteoclast differ- 

20 entiation factor (IVIolecularlVledicine 3.8. 642-648. (2001)) than in the untreated CD34-I- cell, which is the precursor cell 
of monocyte/macrophage line. The result of comparative anaiysis for the frequency between the two cDNA libraries 
prepared from the RNA of CD34+ cells (CD34C) and from the RNA of CD34+ celis treated with the osteoclast differ- 
entiation factor (D30ST, D60ST or D90ST) showed that the genes whose expression levels were different between the 
two were the following clones (Table 3). 

25 KIDNE20062480, NT2Ri200 16570, PLACE60020840, 3NB6920002810, BRACE20035270, BRAWH20000340, 
FEBRA20062700, HSYRA20011 030, NT2RP70030910, OCBBF20011240, PLACE60043120, SYNOV20011440, 
HGASM10001150, IIVIR3220016000, NT2RI20082210, D3OST20001 840, FEBRA20012940, FEBRA20021 91 0, 
IMR3220002230, IIVIR3220012180, 

NT2RI20000640. NT2RI2001 091 0. NT2RI20058110, NT2RP60000350, NT2RP70011 660, PEBLIVI20003950, 
30 PLACE60049310, PROST20062600, TESTI20007840, TESTI2004031 0, TESTI20080200, THYMU1 0003590, 
TRACH1 0000630, TRACH20007800, CD34G2000051 0, HSYRA200 16210, KIDNE20004030, KiDNE20073280, 
NT2RP70055020, PLACE60043960, SKIVIUSI 0000220 
[0255] These genes are involved in osteoporosis. 

35 Genes involved in neural cell differentiation 

[0256] Genes involved in neural cell differentiation are useful for treating neurological diseases. Genes with varying 
expression levels in response to induction of cellular differentiation in neural cells are thought to be involved in neuro- 
logical diseases. 

40 [0257] A survey was performed for genes whose expression levels are varied in response to induction of differenti- 
ation (stimulation by retinoic acid (RA) or growth inhibitor treatment after RA stimulation) in cultured cells of a neural 
strain, NT2. The result of comparative anaiysis of cDNA libraries derived from undifferentiated NT2 cells (NT2RIVI) and 
the cells subjected to the differentiation treatment (NT2RP, NT2RI or NT2NE) showed that the genes whose expression 
levels were different between the two were the following clones (Table 4). 

45 SKNN/IC20000970, 3NB6920009120, BRAWH20006970, KiDNE20062480, NHNPG20002060, NT2NE20053710, 
NT2Ri20000640, NT2RI2000421 0, NT2Ri20006710, NT2RI20009740, NT2Ri20013420, NT2Ri20013850, 
NT2Ri20014100, NT2RI20025410, NT2Ri20033040, NT2RI20035560, NT2Ri20036950, NT2Ri20051500, 
NT2Ri20053350, NT2Ri20057230, 

NT2RI20071330, NT2RI20075720, NT2Ri20083960, NT2RI20087910, NT2Ri20090650, NT2Ri20094060, 

50 NT2RP60000350, NT2RP70000760, NT2RP70036800, NT2RP70071770, NT2RP70074220, TESTI20007840, 
TESTI20080200, 3NB692000281 0, 3NB6920005450, HSYRA2001 5740, HSYRA2001 621 0, IMR322001 6000, 
KIDNE20060140, NT2RI20014490, 

NT2RI2O015950, NT2RI20022520, NT2R 120025 170, NT2RI20025540, NT2RI2003051 0, NT2RI20040590, 
NT2RI20046060, NT2RI20053680, NT2RI20058510, NT2RI20066820, NT2RI20067030, NT2Ri20074980, 
55 NT2RI20075890, NT2RI20078840, NT2R 120084810, NT2Ri20089420, NT2RP70002380, NT2RP70023790, 
NT2RP70029820, NT2RP70049150, 

NT2RP70055020, NT2RP70065270, NT2RP70069860, NT2RP70075370, NT2RP70079750, NT2RP70092590, 
OCBBF20000130, PLACE60043970, TESTI20053960, BNGH420004740, HSYRA20002480, NT2NE1 0000730, 
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NT2NE20000560, NT2NE20003270, NT2NE20008090, NT2NE20014030, NT2RP60000720, 
NT2RP70004770, NT2RP70010800, 

NT2RP70011660, NT2RP70028750, NT2RP70029060, NT2RP70030550, NT2RP70032030, 
NT2RP70064900, NT2RP70093220, NT2RP70093730, SYNOV20013740, TESTI20021490, 
TRACH20007800, 3NB6920003300, BRACE1 0000200, BRACE2001 8550, FEBRA20008740, 
FEBRA20076220, KIDNE20073520, 

MAMGL1 0000320, NT2NE200021 40, NT2NE20006360, NT2NE20007870, NT2NE20009800, 
NT2RI20002940, NT2RI20014500, NT2RI20016210, NT2RI20029260, NT2RI20037510, 
NT2RI20064120, NT2RI20074390, NT2RI20077230, NT2RI20090660, PLACE60040050, 

3NB691 0001 730, BRACE10001150, 

BRACE20011170, BRACE20020910, BRACE20035270, BRAWH20005220, FEBRA20003970, 
HLUNG20003140, IMR3220009350, IMR3220013170, IMR3220013320, IMR3220014350, 
NT2NE1 00001 40, NT2NE1 0000180, NT2NE1 0000230, NT2N El 0000630, NT2NE1 0000830, 
NT2NE1 0001 630, NT2NE1 0001 850, 

NT2NE200003BO, NT2NE20000640, NT2NE20001740, NT2NE20002590, NT2NE20002990, 
NT2NE20003840, NT2NE20003920, NT2NE20004550, NT2NE20004700, NT2NE20005170, 
NT2NE20005500, NT2NE2Q005860, NT2NE20006580, NT2NE20007060, NT2NE20007630, 
NT2NE20011560; NT2NE2001 2470, 

NT2NE20013240, NT2NE2Q013370, NT2NE20013640, NT2NE2001 3720, NT2NE20014280, 
NT2NE20015300, NT2NE20016230, NT2NE20016260, NT2NE2001 6340, NT2NE20016480, 
NT2NE20016970, NT2NE20034080, NT2NE20044900, NT2NE200471 60, NT2NE2005441 0, 
NT2NE20057200, OCBBF20009040, 

OCBBF20015860, PLACE60020840, PR0ST1 0005260, SKMUS20008630, SMINT20003960, 
SYNOV20011440, TEST1 10000230, TESTI20009700, TESTI2004031 0, THYMU10003290, 
BGG1 12001 0970, BNGH41 0001 980, BRACE1 0001 660, BRACE200 14770, BRACE20034490, 
BRAWH20009440, BRAWH20036930, 

CTONG20020730, CTONG20028030, FCBBF1 0006750, FCBBF2001211 0, FCBBF20015380, 
FEBRA20043250, FEBRA20068730, HCASM10001150, HCASM20002140, HHDPC20000950, 
HSYRA1 0001 370, HSYRA1 0001 780, HSYRA20001350, HSYRA20006050, IMR321 0001 580, 
IMR3220003020, KIDNE20004030, 

KIDNE20073280, MESAN20005010, NT2R1 100001 60, 
NT2RI20003410, 



NT2R1 10000270, 
NT2RI20004120, 
NT2RI20010100, 



KIDNE20060300, 

NT2RI1 0001 640, NT2RI20002700, NT2RI20002820, 

NT2RI20006690, NT2RI20006B50, NT2RI200073B0, NT2RI20008650, 
NT2RI20010910, NT2RI20012350, 

NT2RI20012440, NT2RI20014090, NT2RI20015190, NT2RI20015400, NT2RI20016570, 

NT2RI2001B460, NT2RI20018660, NT2RI20020220, NT2RI20020410, NT2RI20021520, 

NT2RI20022700, NT2RI20025300, NT2RI20025850, NT2RI20026540, NT2RI20028020, 
NT2RI20029580, NT2RI20029700, 

NT2RI20030110, NT2RI200301 90, NT2RI20030670, NT2RI20031540, NT2RI20032050, 

NT2RI20033010, NT2RI20033380, NT2RI20033440, NT2RI20033830, NT2RI20036780, 

NT2RI20042840, NT2RI20043040, NT2RI20043980, NT2RI20044420, NT2RI20047830, 
NT2RI20049160, NT2RI20049840, 

NT2RI20049850, NT2RI2005061 0, NT2RI20050870, NT2RI20056280, NT2R 120056470, 

NT2RI20060710, NT2RI20060720, NT2RI20061270, NT2RI20061830, NT2RI200621 00, 

NT2RI20064870, NT2RI20065060, NT2RI20065530, NT2RI20066670, NT2RI20066790, 
NT2RI20067880, NT2RI20068250, 

NT2RI2006B550, NT2R 120070480, NT2RI20070840, NT2RI20070960, NT2RI20071160, 

NT2RI20072140, NT2RI20072540, NT2RI20073030, NT2RI20073840, NT2RI20073860, 

NT2RI20075070, NT2RI20077290, NT2RI20077510, NT2RI20077540, NT2RI20078270, 

NT2RI20078910, NT2RI20080500, 

NT2RI20081880, NT2RI2008221 0, NT2RI20083360, NT2RI20085260. NT2RI20085980, 

NT2RI20087140, NT2RI20087490, NT2RI2008B01 0, NT2RI200881 20, NT2RI20090830, 

NT2RI20092150, NT2RI20092890, NT0NG1 0001 820, OCBBF20002770, OCBBF20011240, 
PLACE50001130, PLACE6001 4430, 

PROST20029600, PUAEN 10000570, SALGL1 0001570, SKMUS1 0000220, SKMUS20004670, 

SYN0V1 0001 280, TESTI20012690, TESTI20023690, TESTI20028660, TESTI20068720, 

THYMU 10000830, TRACH20002370, 3NB691 0001 290, BRACE1 0000700, BRACE20003320, 
BRACE20079020, BRACE20083800, 



NT2RP60001090, 

NT2RP70036320, 

TRACH20004720, 
FEBRA20074580, 

NT2NE20035690, 
NT2RI20055640, 
TRACH20012890, 

FEBRA20012450, 
NT2NE1 0000040, 
NT2NE10001200, 

NT2NE20003690, 
NT2NE20005360, 
NT2NE20008020, 

NT2NE20014350, 
NT2NE20016660, 
NT2NE20055170, 

STOMA20001210, 
TRACH20013950, 
BRACE20071740, 

FEBRA20007570, 
HHDPC20004620, 
IMR3220002230, 

NT2RI100004B0, 
NT2RI20005970, 
NT2RI20010B30, 

NT2RI20017260, 
NT2RI20022430, 

NT2RI20028520, 

NT2RI20032220, 
NT2RI20041900, 
NT2RI20048400, 

NT2RI20058110, 
NT2RI20063450, 
NT2RI20067350, 

NT2RI20071480, 
NT2RI20074690, 
NT2RI20078790, 

NT2RI20086560, 
NT2RI20091440, 
PEBLM10001440, 

STOMA20002890, 
THYMU1 0000020, 
BRACE20015080, 
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BRACE20092740, FEBRA20008810, FEBRA20017150, FEBRA20067930, HHDPC20000550, HSYRA20008280, 
HSYRA20014760, KIDNE1 0001450, KIDNE20000850, KIDNE20002660, KIDNE20003300, KIDNE20033050, 
KIDNE20045340, NT2RP60000080, NT2RP60000170, NT2RP60000320, NT2RP60000390, NT2RP60000590, 
NT2RP60000860, NT2RP60001000, 
5 NT2RP60001230, NT2RP60001 270, NT2RP7000041 0, NT2RP70000690, NT2RP70002590, NT2RP7000271 0, 
NT2RP70003640, NT2RP7000391 0, NT2RP70004250, NT2RP70005790, NT2RP70006240, NT2RP70008120, 
NT2RP70009060, NT2RP7001231 0, NT2RP7001 3060, NT2RP7001 3350, NT2RP7001591 0, NT2RP7001 8560, 
NT2RP70021510, NT2RP70022430, 

NT2RP70023760, NT2RP70024490, NT2RP70024500, NT2RP70025540, NT2RP700261 90, NT2RP70028290, 
10 NT2RP7002a410, NT2RP70030500, NT2RP70030910, NT2RP70033040, NT2RP70036290, NT2RP70036470, 
NT2RP70039600, NT2RP70040800, NT2RP70042040, NT2RP70042330, NT2RP70042600, NT2RP70043730, 
NT2RP70043960, NT2RP70045410, 

NT2RP70046560, NT2RP70046870, NT2RP70047510, NT2RP70047660, NT2RP70047900, NT2RP70049250, 
NT2RP70049750, NT2RP70052050, NT2RP70052190, NT2RP70054680, NT2RP70054930, NT2RP700551 30, 
15 NT2RP70055200, NT2RP70061 620, NT2RP70061 880, NT2RP70062960, NT2RP70063040, NT2RP70063740, 
NT2RP70064080, NT2RP70066210, 

NT2RP70067010, NT2RP70069800, NT2RP70071140, NT2RP70071540, NT2RP7007221 0, NT2RP70072520, 
NT2RP70073590, NT2RP7007381 0, NT2RP70074060, NT2RP70075040, NT2RP70076100, NT2RP700761 70, 
NT2RP70076430, NT2RP70079250, NT2RP70079300, NT2RP70081330, NT2RP70081370, NT2RP70081420, 

20 NT2RP70081440, NT2RP70081670, 

NT2RP70083150, NT2RP70084060, NT2RP70084410, NT2RP70084870, NT2RP70085500, NT2RP70085570, 
NT2RP70086230, NT2RP70087200, NT2RP70088550, NT2RP70090120, NT2RP70090190, NT2RP70091490, 
NT2RP700916B0, NT2RP70092150, NT2RP70092360, NT2RP70093630, NT2RP70093700, NT2RP70093940, 
NT2RP70093970, NT2RP70094290, 

25 NT2RP70094660, NT2RP70094810, NT2RP70094980, NT2RP70095020, NT2RP70095070, NT0NG1 0000980, 
NT0NG1 00021 40, NTONG20002650, NTONG20016120, PEBLM20003950, PROST1 0005640, PROST20003250, 
SKNMC20000650, SKNSH1 0000860, SKNSH20003470, TEST1 10000510, TESTM 0000960, TESTI20015110, 
TESTI20074640, TRACH20004610 

[0258] These genes are neurological disease-related genes. 

30 

Cancer-related genes 

[0259] It has been assumed that, distinct from normal tissues, cancer tissues express a distinct set of genes, and 
thus the expression can contribute to the carcinogenesis in tissues and cells. Thus, the genes whose expression 
35 patterns in cancer tissues are different from those in normal tissues are cancer-related genes. Search was carried out 
for the genes whose expression levels in cancer tissues were different from those In nomnai tissues. 
[0260] The result of comparative analysis of cDNA libraries derived from breast tumor (TBAES) and normal breast 
(BEAST) showed that the genes whose expression levels were different between the two were the following clones 
(Tables). 

40 3NB691 0001 730, FCBBF1 0007600, KIDNE20033050, KIDNE20060300, NT2RI20065530, NT2RP60000720, 
NT2RP70075370, TRACH20004200, LIVER1 0000670, LIVER1 0005420, LIVER20000370 

[0261 ] The result of comparative analysis of cDNA libraries derived cervical tumor (TCERX) and normal cervical duct 
(CERVX) showed that the genes whose expression levels were different between the two were the following clones 
(Table 6). 

45 BRACE1 0001 590, HHDPC20000950, HSYRA20016210, NT2RI20074980, 3NB6920014330, NT2RI20087490, 
NT2RP60001090, PROST1 0002200, SKNMC20003220, STOMA20001210 

[0262] The result of comparative analysis of cDNA libraries derived fram colon tumor (TCOLN) and normal colon 
(COLON) showed that the genes whose expression levels were different between the two were the following clones 
(Table 7). 

50 BRACE20028610, BRACE20011170, BRACE20035940, IMR3220013320, NT2NE20053710 

[0263] The result of comparative analysis of cDNA libraries derived from esophageal tumor (TESOP) and normal 
esophagus (NESOP) showed that the genes whose expression levels were different between the two were the following 

clones (Table 8). 

KIDNE20005740, MAMGL1 0000320, NES0P1 0000870, NT2RI20056470, NTONG20008000 
55 [0264] The result of comparative analysis of cDNA libraries derived from kidney tumor (TKIDN) and normal kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 9). 

3NB6920002810, ADRGL1 0000020, BNGH420004740, BRACE 10000200, BRACE1 0000420, BRACE1 0000730, 
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BRACE1 0001 590, BRACE20005650, BRACE20016730, BRACE20028120, BRACE20077980, BRACE20083800, 
BRACE200B3850, BRAWH1 0001 740, BRAWH20036930, BRAWH20064500, BRAWH20064930, CTONG20028030, 
FCBBF20015380, FEBRA20005360, 

FEBRA20007570, FEBRA20008740, FEBRA20012270, FEBRA20025250, HSYRA20002480, HSYRA20006400, 
HSYRA20008280, HSYRA20015740, HSYRA20016210, IMR3220009350, LIVER10001110, NT2NE20003920, 
NT2NE20007630, NT2NE20007870, NT2RI2002541 0, NT2RI20026540, NT2RI20029580, NT2RI20033380, 

NT2RI20033830, NT2RI20051500, 

NT2RI20058110, NT2RI20090650, NT2RP60000720, NT2RP7001 3350, NT2RP70023790, NT2RP70024490, 
NT2RP70028750, NT2RP70029060, NT2RP7Q036a00, NT2RP70075370, NT2RP700761 00, NT0NG1 0000980, 
NT0NG1 0002460, NTONG20015500, OCBBF2000231 0, OCBBF20013070, PEBLM20001470, PEBLM20003950, 
PLACE60021510, PLACE60040050, 

PLACE60043970, PROST20051430, STOMA20001210, STOMA20002570, STOMA20002890, SYNOV20011440, 
TEST1 10000230, TESTI20009700, TESTI20021490, TESTI20032B00, TESTI20053960, TESTI20080200, 
TESTI20082400, BGGI120010970, BRACE2000421 0, BRACE20005250, BRACE20011170, BRACE2002091 0, 
BRACE20080970, BRAWH20000340, 

BRAWH20006970, BRAWH20011660, FCBBF20001950, FEBRA20043250, HLUNG1 0000640, IMR3220007420, 
IMR3220014350, KIDNE1 0000080, KIDNE1 0000280, KIDNE1 0000500, KIDNE1 0001 040, KIDNE10001430, 
KIDNE10001450, KIDNE10001520, KIDNE2000041 0, KIDNE2000051 0, KIDNE20000700, KIDNE20000850, 
KIDNE20001670, KIDNE20001 920, 

KIDNE20002440, KIDNE20002450, KIDNE20002660, KIDNE20003150, KIDNE20003300, KIDNE20003490, 
KIDNE20003750, KIDNE20004030, KIDNE20004220, KIDNE20004970, KIDNE20005130, KIDNE200051 70, 
KIDNE20005190, KIDNE20005740, KIDNE20031 850, KIDNE20033050, KIDNE20033350, KIDNE20033570, 
KIDNE20033730, KIDNE20033770, 

KIDNE20037520, KIDNE20039410, KIDNE20039940, KIDNE20040340, KIDNE20040540, KIDNE20040840, 
KIDNE20042620, KIDNE20042940, KIDNE20042950, KIDNE20043440, KIDNE20044110, KIDNE20045200, 
KIDNE20045340, KIDNE20045790, KIDNE20046810, KIDNE20048280, KIDNE20048640, KIDNE20048790, 
KIDNE20049810, KIDNE20050420, 

KIDNE20052960, KIDNE20a53360, KIDNE20054000, KIDNE20054770, KIDNE20056290, KIDNE20056760, 
KIDNE2Q059080, KIDNE20059370, KIDNE20Q601 40, KIDNE20060300, KIDNE2006D530, KIDNE20060620, 
KIDNE20061490, KIDNE20062990, KIDNE20063530, KIDNE20063760, KIDNE20066520, KIDNE20067600, 
KIDNE20067750, KIDNE20068800, 

KIDNE20070050, KIDNE20070770, KIDNE20071 860, KIDNE20073280, KIDNE20073520, KIDNE20073560, 
KIDNE20074220, KIDNE20075690, KIDNE2007B KIDNE20078110, LIVER1 0000790, MAMGL1 0000320, 
NB9N41 0000470, NT2NE20053710, NT2RI20006710, NT2RI20013420, NT2RI20016570, NT2RI2001 8460, 
NT2RI20025540, NT2RI20040590, 

NT2RI20065530, NT2RI20087490, NT2RI20087910, NT2RP60000350, NT2RP60001230, NT2RP70043730, 
NT2RP70069860, NT2RP70074220, OCBBF20014940, PLACE60020840, PLAGE60043120, PR0ST1 0003430, 
SKNSH20001510, SMINT1 00001 60, SPLEN200Q0470, SPLEN20001 340, SPLEN20003570, STO MAI 0000470, 
TEST1 10000700, TESTI20027070, 

TESTI20040310, TRACH1 0000300, TRACH20000790, TRACH20002500, TRACH20007800 
[0265] The result of comparative analysis of cDNA libraries derived from livertumor (TLIVE) and normal liver (LIVER) 
showed that the genes whose expression levels were different between the two were the following clones (Table 10). 
FGBBF50002610, FEBRA20076220, KIDNE20033050, NT2NE20003840, KIDNE20062480, KIDNE20068800, 
LIVER1 0000580, LIVER1 0000670, LIVER10000790, LIVER1 0000990, LIVER10001040, LIVER10001110, 
LIVER1 0001 750, LIVER1 0002300, LIVER1 0002780, LIVER1 0003030, LIVER1 0004330, LIVER1 0005420, 
LIVER20000330, LIVER20004160, LIVER20004460, LIVER20005150, NT2NE20002140, NT2RI20030510, 
NT2RI20043040, NT2R 120090650, PROST1 0005640, PROST20032320, SALGL1 0001 570, SMINT1 00001 60, 
SPLEN20002420, TESTI20002530, TESTI20080200, THYMU1 0003590, TRACH20004720 
[0266] The result of comparative analysis of cDNA libraries derived from lung tumor (TLUNG) and normal lung 
(HLUNG) showed that the genes whose expression levels were different between the two were the following clones 
(Table 11). 

NT2RI20030110, BNGH41 0001 980, BRACE1 0000420, BRACE1 0001150, BRACE2001 4770, BRACE2001 8550, 
BRAWH20006970, BRAWH2001 461 0, FEBRA2000881 0, FEBRA2001 5840, FEBRA20044120, HHDPC20001490, 
HLUNG1 0000240, HLUNG1 0000300, HLUNG1 0000370, HLUNG1 0000640, HLUNG1 0000760, HLUNG1 0000990, 
HLUNG10001050, HLUNG1 0001 1 00, 

HLUNG20000680, HLUNG20001160, HLUNG20001250, HLUNG20001420, HLUNG20001 760, HLUNG20002550, 
HLUNG20003140, HLUNG20004120, HLUNG20004800, HLUNG20005010, HSYRA20014200, KIDNE20002660, 
KIDNE20033050, NT2NE20014350, NT2RI2001 6570, NT2RI20026540, NT2R 12005 1500, NT2RI20064120, 
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NT2RI20083960, NT2RI20085260, 

NT2RI20087490, NT2RP70009060, NT2RP70011660, NT2RP70029060, NT2RP70055020, NT2RP70074220, 
NT2RP70076100, NT0NG1 0002460, NTONG20008000, PLACE60043120, SKMUS20016340, SKNMC20005930, 
SMINT20000180, SMINT20002390, SMINT20002770, SMINT20003960, STOMA1 0000470, STOMA20001 880, 
5 SYNOV20013740, TESTI20036250, TESTI20080200, TRACH2000461 0 

[0267] The result of comparative analysis of cDNA libraries derived from ovary tumor (TOVER) and normal ovary 
(NOVER) showed the genes whose expression levels were different between the two were the following clones (Table 
12). 

BRACE20011880, TESTI20030710, BRACE20076210, NT2RI20053680, SKMUS20008630, TESTI20005910, 

10 TESTI20040310 

[0268] The result of comparative analysis of cDNA libraries derived from stomach tumor (TSTOM) and normal stom- 
ach (STOIVI A) showed that the genes whose expression levels were different between the two were the following clones 
(Table 13). 

HSYRA20011030, NT2RI20013420, NT2RP70079750, BRACE20003320, HEART20005060, HHDPC20000950, 
15 HLUNG20004120, HLUNG200Q5Q1 0, HSYRA20006400, KIDNE1 0000500. KIDNE20062480, NT2NE2005371 0, 
NT2NE20054410, NT2RI20015400, NT2RI2001 6570, NT2RI200641 20, NT2RI20070840, NT2RI20071330, 
NT2RI200749aO, NT2RI20077230, NT2RI2Q0a9420, NT2RP70000760, NT2RP7002B750, PLACE600 14430, 
PLACE60024190, SKNMC20000970, ST0MA1 0000470, STOMA1 0000520, ST0MA1 0001 170, ST0MA1 0001 330, 
ST0MA1 0001 860, STOMA20000320. STOMA20000880, STOMA20001 21 0, STOIVIA20001 880, STOMA20002570, 
20 STOMA20002890, STOMA20003960, STOMA20004780, STOMA20004820, 
THYMU 10003590 

[0269] The result of comparative analysis of cDNA libraries derived from uterine tumor (TUTER) and normal uterus 
(UTERU) showed that the genes whose expression levels were different between the two were the following clones 
(Table 14). 

25 NT2RI20085260, 3NB692000281 0, BRACE1 0000420, BRACE20089990, BRACE20092120, BRAWH1 0001 680, 
BRAWH20011410, BRAWH20011660, FCBBF20005910, FCBBF50002610, FEBRA20005360, FEBRA20006800, 
FEBRA20008800, FEBRA20044120, FEBRA20057520, HEART20005060, HHDPG20000950, HLUNG1 0000760, 
HLUNG20003140, HSYRA20014200, 

HSYRA2001476Q, HSYRA20015800, IMR321 0002420, IMR3220002230, IMR3220009350, IMR3220014350, 
30 IMR3220016000, KIDNE20000850, KIDNE20060140, KIDNE20060300, MAMGL1 0000350, NT2NE20035690, 
NT2NE20053710, NT2RI1 0000270, NT2RI20000640, NT2RI20002940, NT2RI2001 091 0, NT2RI2001 3420, 
NT2RI20016570, NT2RI20033380, 

NT2RI20036950, NT2R 120037510, NT2RI20053350, NT2RI20057230, NT2RI20058110, NT2RI20071480, 
NT2RI20074980, NT2RI20084810, NT2RI20087490, NT2RI2008791 0, NT2RP60000350, NT2RP70032030, 
35 NT2RP70043730, NT0NG1 0000980, NT0NG1 0002460, PLACE60014430, PLACE60026680, PLAGE60043960, 
PLACE60044910, PLACE60047380, 

PR0ST1 0002200, PR0ST1 0005260, PROST20025910, PROST20033380, PUAEN1 0000570, SALGL10001570, 
SKMUS10000140, SKMUS20003430, SKMUS20009540, SKNMC1 000251 0, SKNMC20000970, SKNSH1 0000860, 
SMINT20002770, STOMA20002890, SYNOV20011440, TESTI1 0000230, TESTI2001 8290, TESTI20021490, 
40 TESTI20080200, TESTI20082400, 

TRAGH1 0000300, TRAGH20002370, TRAGH20007800, TRACH200 12890, UTERU1 0000770, UTERU1 0000960, 
UTERU1 0001 600, UTERU1 0001 920, UTERU20O004 UTERU20003380, UTERU20003930, UTERU20004850, 
UTERU20005410, UTERU20005690 

[0270] The result of comparative analysis of cDNA libraries derived from tongue cancer (CTONG) and normal tongue 
45 (NTONG) showed that the genes whose expression levels were different between the two were the following clones 

(Table 15). 

3NB691 0001 160, 3NB691 0001 290, 3NB691 0001 730, BNGH420004740, BRACE20008850, BRACE20020910, 
BRACE20074010, BRAWH200 14840, BRAWH20089560, CTONG20003030, CTONG20005890, CTONG20007710, 
GTONG20008270, GTONG20011390, GTONG200 13200, CTONG20013660, CTONG20015330, GTONG20018200, 

50 CTONG20019110, CTONG20019550, CTONG20020730, CTONG20021430, CTONG200241 80, CTONG20024530, 
GTONG20025580, CTONG2002721 0, CTONG20028030, CTONG200281 60, CTONG20028200, CTONG20029650, 
CTONG20037820, CTONG200471 60, CTONG20055530, CTONG20064490, FEBRA20003770, FEBRA20004520, 
FEBRA20007400, FEBRA20007570, FEBRA20012940, FEBRA20021 940, FEBRA200441 20, HCASM10001150, 
HHDPC20004560, HLUNG20003140, HSYRA20002480, IMR3220009350, IMR322001 21 80, KIDNE20000850, 

55 KIDNE20002660, KIDNE20004220, KIDNE20005740, KIDNE20056760, KIDNE20060140, KIDNE20062480, 
MESAN20000920, MESAN20003370, NHNPC20002060, NT2NE1 0001 850, NT2NE20000560, NT2NE200021 40, 
NT2NE20003270, NT2NE20003840, NT2NE20014350, NT2NE2005371 0, NT2RI20006690, NT2RI20006710, 
NT2RI20016570, NT2RI20018660, NT2RI20025300, NT2RI20025410, NT2RI200301 90, NT2RI20030510, 
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NT2RI20036950, NT2RI20046060, NT2RI20053350, NT2RI20067350, NT2RI20075720, NT2RI20078790, 
NT2RI20083960, NT2RI20087140, NT2RI20094060, NT2RP60000350, NT2RP60001230, NT2RP70000760, 
NT2RP70004770, NT2RP70009060, NT2RP70011 660, NT2RP70023760, NT2RP70023790, NT2RP70024500, 
NT2RP70026190, NT2RP70029820, NT2RP70036470, NT2RP70043730. NT2RP70061 880, NT2RP70071 770, 

5 NT2RP70076100, NT2RP70079750, NT2RP70084870, NT2RP 70093730, OCBBF20013070, PEBLM20003950, 
PLACE60037450, PLACE600431 20, PR0ST1 0003430, PR0ST1 0005260, PROST20032320, PROST20033020, 
PROST20056040, SKNMC1 000251 0, SKNMC20000650, SKNMC2001 0570, SKNSH20003470, SMINT200001 80, 
SYNOV20013740, TEST1 10000230, TEST1 10001 680, TESTI20007840, TESTI20021490, TESTI20022230, 
TESTI20023690, TESTI20030050, TESTI2Q042950, TESTI20068720, TESTI20080200, TRACH2001 2890, 

10 BRACE20006980, BRACE20092740, BRAWH20006970, FCBBF1 0007600, FEBRA20062700, IMR322001 6000, 
KIDNE20073280, MAMGL1 0000350, NT2NE20035690, NT2RI20056470, NT2RI20058110, NT2RI20084810, 
NT2RI20085260, NT2RP70015910, NT2RP70036290, NT2RP70036320, NT2RP70074220, NT2RP70075370, 
NT0NG1 0000330, NT0NG1 0000520, NTONG1 0001 230, NTONG1 0001 300, NT0NG1 0001 820, NT0NG1 00021 40, 
NT0NG1 0002460, NT0NG1 0002570, NTONG1 0002640, NTONG20002650, NTONG20003340, NTONG20003630, 

15 NTONG20004920, NTONG20005830, NTONG20008000, NTONG20008780, NTONG20009660, NTONG20009850, 
NTONG20011370, NTONG20012220, NTONG20014280, NTONG20015500, NTONG2001 61 20, OCBBF20011240, 
OCBBF20015a6G, PR0ST1 0002200, SKMUS20016340, SKNMC20000970, STOMA20004820, SYNOV10001280, 
SYNOV20011440, THYMU1 0000830, TRACH20000790, TRACH20009260 
[0271] These genes are involved in cancers. 

20 [0272] Further, there is a method to search for genes involved In development and differentiation: the expression 
frequency analysis in which the expression levels of genes are compared between developing or differentiating tissues 
and/or cells and adult tissues and/or cells. The genes Involved In tissue development and/or differentiation are genes 
participating in tissue construction and expression of function, and thus are useful genes, which are available for re- 
generative medicine aiming at convenient regeneration of Injured tissues. 

25 [0273] Search was carried out for the genes whose expression frequencies were different between developing and/ 
or differentiating tissues and/or cells, and adult tissues and/or cells, by using the Information of gene expression fre- 
quency based on the database of the nucleotide sequences of 770,546 clones shown above. 
[0274] The result of comparative analysis of cDNA libraries derived from fetal brain (FCBBF, FEBRAorOCBBF) and 
adult brain (BRACE, BRALZ, BRAIVIY BRAWH, BRCAN, BRCOC, BRHIP, BRSSN, BRSTN or BRTHA) showed that 

30 the genes whose expression levels were different between the two were the following clones (Tables 1 6 to 36), 

BRACE20028960, BRACE2007401 0, BRACE20077080, BRACE20077980, BRACE200B3B00, BRACE20088570, 
BRAWH1 0000010, BRAWH 1 0000020, BRAWH 1 0000070, BRAWH1 0000370, BRAWH 1 0000940, BRAWH1 0001 300, 
BRAWH1 0001 640, BRAWH1 0001 680, BRAWH1 0001 740, BRAWH1 0001 800, BRAWH20000340, BRAWH20000340, 
BRAWH20000480, BRAWH20000930, 

35 BRAWH20001 770, BRAWH20002480, BRAWH20003230, BRAWH20004430, BRAWH20004760, BRAWH20005030, 
BRAWH20005540, BRAWH20006330, BRAWH2000651 0, BRAWH20006970, BRAWH20008660, BRAWH20008920, 
BRAWH20009010, BRAWH20009440, BRAWH20009840, BRAWH20011030, BRAWH20011290, BRAWH20011660, 
BRAWH20012030, BRAWH20014180, 

BRAWH20014380, BRAWH20014610, BRAWH20015030, BRAWH20036B90, BRAWH20038320, BRAWH2004731 0, 
40 BRAWH20059980, BRAWH20060440, BRAWH20064930, BRAWH20066220, BRAWH20069600, BRAWH20069890, 
BRAWH20074060, BRAWH20076050, BRAWH20089560, BRAWH20092270, BRAWH2009261 0, BRAWH20093600, 
BRAWH20094850, iiVIR3220013170, 

KiDNE200a0850, KIDNE20004220, KIDNE20031 850, KIDNE20050420, MAMGL1 0000350, NT2NE20001 740, 
NT2RI20042840, NT2RI20086560, NT2RP70002590, NT2RP70065270, NT2RP70074220, NT0NG1 0001 820, 
45 PEBLM20001470, PLAGE60032040, SKMUS1 0000140, SMINT20005450, TESTI20004350, TESTI20008830, 
TRACH20007800, TRACH20016070, 

UMVEN20001330, 3NB691 0001 730, 3NB6920002810, ADRGL20000740, BNGH41 0001 370, BNGH41 0001 980, 
BRAGE1 0000200, BRACE1 0000730, BRACE1 0000930, BRACE20000770, BRAGE20001000, BRACE20001410, 
BRACE20002800, BRACE20003320, BRACE20005050, BRACE20005250, BRAGE20005450, BRACE20005650, 

50 BRACE20005650, BRACE20005770, 

BRAGE20006980, BRACE200071 80, BRAGE20008850, BRACE20009880, BRACE2001 0650, BRACE2001 0700, 
BRACE20011170, BRACE20011430, BRACE20011430, BRACE2001 1 880, BRACE20013400, BRAGE2001 3520, 
BRACE20013740, BRACE20013750, BRACE2001 4230, BRACE20014530, BRAGE2001 4550, BRAGE2001 4770, 
BRACE20014920, BRAGE2001 5080, 

55 BRACE20015430, BRACE2001 6730, BRACE2001 6920, BRAGE2001 7370, BRACE2001 8550, BRAGE2001 8590, 
BRACE20018650, BRAGE20018980, BRAGE20021510, BRAGE20021 760, BRAGE20022020, BRAGE20022270, 
BRAGE20024090, BRACE20024090, BRACE2002431 0, BRAGE20024680, BRAGE20024950, BRACE20025900, 
BRAGE20026350, BRAGE20026850, 
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BRACE20027360, BRACE20027520, BRACE20027550, BRACE20027720, BRACE20027920, BRACE20027960, 
BRACE20028120, BRACE20028600, BRACE20030780, BRACE20032850, BRACE20033190, BRACE20033980, 
BRACE20034310, BRACE20035160, BRACE20035940, BRACE20071380, BRACE20071530, BRACE20071970, 
BRACE20072010, BRACE20072320, 

BRACE20072810, BRACE20074470, BRACE20075020, BRACE20075270, BRACE20075380, BRACE20075630, 
BRACE20076210, BRACE20076460, BRACE20076630, BRACE20076850, BRACE2007761 0, BRACE20077640, 
BRACE20077670, BRACE20077840, BRACE20078680, BRACE20079020, BRACE20079530, BRACE20080970, 
BRACE20081140, BRACE20083850, 

BRACE20084430, BRACE20084880, BRACE20086530, BRACE20086550, BRACE20087080, BRACE20087540, 
BRACE20089600, BRACE20089990, BRACE20090140, BRACE20091880, BRACE20092120, BRACE20092750, 
BRACE20093070, BRACE20093110, BRACE20094370, CTONG20008270, CTONG20013200, CTONG20020730, 
CTONG20064490, HHDPC20000950, 

HHDPC20001150, HHDPC20004560, HSYRA1 0001 780, HSYRA20008280, HSYRA20011530, IMR32 10002660, 
IMR3220003020, IMR3220009350, KIDNE20003300, KIDNE20004970, KIDNE20005170, KIDNE20059370, 
KIDNE20058800, KIDNE20073280, LIVER20000370, MESAN20002670, NT2NE20005170, NT2NE20011560, 
NT2NE20013640, NT2NE20016970, 

NT2RI20006710, NT2RI20009740, NT2RI20022430, NT2RI20025300, NT2RI20028020, NT2RI20029260, 
NT2RI20030110, NT2RI2003051 0, NT2RI20040590, NT2RI20046060, NT2RI20049840, NT2RI20049850, 
NT2RI20056470, NT2RI20060720, NT2RI200621 00, NT2RI20067350, NT2RI20068250, NT2RI20070840, 
NT2RI20070960, NT2RI20071480, 

NT2RI20072540, NT2RI20074980, NT2RI20085260, NT2RI20088120, NT2RI20090660, NT2RI20090830, 
NT2RP70013060, NT2RP70013350, NT2RP70023760, NT2RP70024500, NT2RP7003091 0, NT2RP70036320, 
NT2RP70036470, NT2RP70042330, NT2RP70054930, NT2RP70064900, NT2RP70071140, NT2RP70075370, 
NT2RP70076100, NT2RP70079750, 

NT2RP70081370, NT2RP70090120, NT2RP70091 490, NT2RP70093730, NTONG20014280, NTONG20015500, 
PEBLM1 0000340, PLACE60014430, PLACE60a20840, PLACE60024190, PLACE60026920, PLACE60030380, 
PLACE60038500, PLACE60043970, PR0ST1 0002720, PROST20000530, PROST20021 620, PROST20032320, 

PROST20033380, PROST20062600, 

SALGL1 0000050, SALGL1 0001 570, SKMUS1 0000220, SKMUS200011 70, SKMUS2000271 0, SKMUS20009540, 
SKMUS20011290, SKMUS20015010, SKMUS20015430, SKMUS20016340, SKNMC20002240, SKNMC2001 5030, 
SKNSH20001510, SMINT1 0001 000, SMINT20002390, SPLEN20001970, STOMA20001210, STOMA20002570, 
SYNOV20002910, SYNOV20011440, 

TESTI10000510, TESTI1 0000700, TESTI1 0001 680, TESTI20005200, TESTI20015110, TESTI2001 8290, 
TESTI20018690, TESTI20018980, TESTI20024670, TESTI20032800, TESTI20033250, TESTI20036250, 
TESTI201 36910, THYMU1 0000830, THYMU1 0003290, THYMU1 0003590, UTERU1 0000960, UTERU20005690, 
ADRGL1 0000650, BGG1 12001 0970, 

BRACE20004210, BRACE20020500, BRACE20020910, BRACE20024780, BRACE20028610, BRACE20031100, 
BRACE20035270, BRACE20035390, BRACE20071 740, BRACE20077270, BRAWH20001 090, CTONG20024530, 
CTONG20028200, CTONG20055530, FCBBF1 0005980, FCBBF1 00061 80, FCBBF1 0006870, FCBBF1 0006910, 

FCBBF1 0007320, FCBBF1 0007600, 

FCBBF20000940, FGBBF20001 050, FCBBF20001950, FCBBF20002320, FCBBF20002760, FCBBF2000576 
FCBBF20005910, FCBBF20006770, FCBBF20007330, FCBBF2000B0B0, FCBBF20008150, FCBBF20009400, 
FGBBF20009510, FCBBF20012110, FCBBF20012990, FCBBF20014800, FGBBF20016720, FCBBF2001 71 80, 
FCBBF20017200, FCBBF40002820, 

HGASM 10001 150, HHDPC20001490, HLUNG1 0000640, HLUNG20003140, HLUNG20005010, HSYRA20001350, 
HSYRA20014760, HSYRA20016310, IMR3220007420, IMR3220009730, IMR3220009840, IMR3220012180, 
IMR3220013320, KIDNE20002660, KIDNE20056760, KIDNE20073520, LIVER20004160, MESAN20000920, 
NT2NE200 15300, NT2NE20035690, 

NT2RI20010910, NT2RI20016210, NT2RI20016570, NT2RI20033040, NT2R 120033440, NT2RI20058110, 
NT2RI20065060, NT2RI20087490, NT2RP60000720, NT2RP70002710, NT2RP7001 231 0, NT2RP70036800, 
NT2RP70055020, NT2RP70055130, NT2RP70061 880, NT2RP7008441 0, PLAGE60037450, PLAGE6004931 0, 
PR0ST1 0005260, PROST2001 8230, 

PROST20051430, SKMUS20000740, SKMUS20011470, SKNMG20003560, SKNSH20001630, SPLEN1 0000490, 
STOMA20002890, SYNOV2001 3740, TESTI20011410, TESTI20033760, TESTI20074640, TRACH1 0000300, 
TRAGH20013950, 3NB6920003300, 3NB6920009120, ADRGL1 00001 80, BRACE1 0001 150, BRACE10001590, 
BRACE1 0001 690, BRACE20077680, 

BRAGE20092740, BRACE2009361 0, BRACE20095170, BRAWH20011410, BRAWH20036930, BRAWH20064500, 
BRAWH20087060, CTONG20019110, FEBRA20000350, FEBRA20000530, FEBRA20001050, FEBRA20001290, 
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FEBRA20003110, FEBRA20003300, 
FEBRA20004040, FEBRA20004150, 
FEBRA20004540, FEBRA2000491 0, 

FEBRA20007400, FEBRA2000771 0, 
5 FEBRA20008800, FEBRA2000881 0, 

FEBRA20011460, FEBRA20012270, 

FEBRA20012940, FEBRA2001351 0, 

FEBRA20017900, FEBRA2001 9890, 

FEBRA20027830, FEBRA20028a20, 
10 FEBRA20033080, FEBRA20034290, 

FEBRA20037070, FEBRA20041100, 

FEBRA20043290, FEBRA20044430, 

FEBRA20050790, FEBRA20052160, 

FEBRA20057520, FEBRA20059980, 
15 FEBRA20060920, FEBRA20061500, 

FEBRA20066670, FEBRA20067360, 

FEBRA20072000, FEBRA2Q072aOQ, 

HSYRA20015800, IMR3220002230; 

KIDNE20005740, KIDNE2Q05336Q, NT2NE20002140, NT2NE20003270, NT2NE20003840, NT2NE20007870, 
20 NT2NE20047160, NT2NE2005371 0, NT2RI2002541 0, NT2RI20051500, NT2RI20056640, NT2RI2005851 0, 

NT2RI20061830, NT2RI20064120, NT2RI20071330, NT2RI20075890, NT2RI20077230, NT2RI20094060, 

NT2RP70002380, NT2RP70009060, 

NT2RP70015910, NT2RP70094810, NT2RP70094980, PLACE60012810, PLACE60040050, SKMUS20008630, 
SKNMC20003050. SKNSH20003470, TESTI20003560, TESTI20012690, TESTI2003071 0, TESTI20082400, 
25 TRACH20009260, 3NB691 0001 160, 3NB6920015280, BRACE1 0000700, BRACE20019440, BRAWH20052250, 
KIDNE20045340, NT2NE20002590, 

NT2NE20014030, NT2RI20020220, NT2RI20026540, NT2RI20060710, NT2RI20083960, NT2RI20084810, 
NT2RP70011660^ NT2RP70021 51 0, NT2RP70024490, NT2RP700261 90, NT2RP70039600, NT2RP70049250, 
NT2RP70071770, NT2RP70093940, NT0NG1 0002640, NTONG20002650, 0CBBF1 0000420, 0CBBF1 0000670, 

30 OCBBF1 0000860, 0CBBF1 000091 0, 

OCBBFi 0001 040, OCBBF10001180, OCBBF10001190, 0CBBF1 0001 220, OCBBF20002770, OCBBF20002870, 
OCBBF20007190, OCBBF20008240, OCBBF20009980, OCBBF20010750, OCBBF20011 01 0, OCBBF200i1400, 
OCBBF20011760, OCBBF20014080, OCBBF20014940, OCBBF20015270, OCBBF20015280, OCBBF20015860, 
OCBBF20017060, PLACE60043960, 

35 SMINT20002770, TESTI20001790, TESTI20007840, TESTI20009700, TESTI20027070, TESTI20053960, 
TRACH20000790, 3NB6920010220, BRACE1 0001 870, BRAWH20014840, BRAWH20040950, FEBRA20011970, 
KIDNE20062480, NT2RI20029580, NT2RI20035560, NT2RI20043980, NT2RP70000760, NT2RP70042040, 
NT2RP70069860, NT2RP70088550, 

OCBBF20001260, TESTI1 0000230, 3NB692001 71 90, ADRGL1 0000020, BRACE1 0001 660, BRAWH1 0001 620, 
40 CTONG20028030, KIDNE20004030, KIDNE20060300, NB9N420000420, NT2NE20000560, NT2NE20004700, 
NT2NE20007630, NT2RI20004120, NT2RI20013420, NT2RI200333B0, NT2RI20036950, NT2RI20053350, 
NT2RI20053680, NT2RI20078840, 

NT2RI20083360, NT2RI20090650, NT2RP60001 090, NT2RP70004770, NT2RP70023790, NT2RP70055200, 
NT2RP70085570, NT0NG1 0000980, NTONG20016120, OCBBF20011240, OCBBF20012100, OCBBF20013070, 
45 OCBBF20014020, PEBLM20003950, PLACE50001130, PLACE60021510, PUAEN1 0000570, SKNMC20000970, 
TESTI20040310, TRACH20004610, 

3NB6920005450, BRACE1 0000420, BRACE20076410, BRACE20078820, BRAWH20006860, BRAWH20089030, 
FGBBF100O67 FCBBF1 0006860, FCBBF200 15380, FCBBF50002610, FEBRA20004520, FEBRA20005360, 
FEBRA20009010, FEBRA20014920, FEBRA20015840, FEBRA20021910, FEBRA20021940, FEBRA20043250, 

50 FEBRA20057780, FEBRA20057880, 

FEBRA20066270, FEBRA20074580, HHDPC20000550, HSYRA2001 5740. HSYRA2001 621 0, IMR32 10002420, 
IMR3220016000, KIDNE200601 40, MAMGL1 0000320, NT2NE20008090, NT2NE20014350, NT2RI20000640, 
NT2RI20002940, NT2RI20015400, NT2RI20033830, NT2RI2003751 0, NT2RI20057230, NT2RI2008791 0, 
NT2RI20089420, NT2RP70043730, 

55 NT2RP70047900, PLAGE60043120, PROST20033020, SYN0V1 0001280, TESTI20021490, THYMU20002360, 
TRACH20012890, 3NB6910001290, BNGH420004740, BRACE20034490, BRAWH20005220, FEBRA20003770, 
FEBRA20007570, FEBRA20008740, FEBRA20012450, FEBRA20012450, FEBRA20017150, FEBRA20025250, 
FEBRA20044120, FEBRA20076220, 



FEBRA20003780, FEBRA20003910, FEBRA20003970, FEBRA20003990, 

FEBRA20006560, FEBRA20006800, FEBRA20006900, FEBRA20007330, 

FEBRA20007720, FEBRA20007870, FEBRA20008090, FEBRA20008560, 

FEBRA20Q09590, FEBRA20009720, FEBRA2001 0930, FEBRA20011330, 

FEBRA20014870, FEBRA20015900, FEBRA2001591 0, FEBRA2001 7060, 

FEBRA20020860, FEBRA20024290, FEBRA20024420, FEBRA20027270, 

FEBRA20028970, FEBRA20029080, FEBRA20030540, FEBRA20031550, 

FEBRA20041910, FEBRA20042240, FEBRA20042370, FEBRA20042930, 

FEBRA20044900, FEBRA20045920, FEBRA20048180, FEBRA20050140, 

FEBRA20053770, FEBRA20053800, FEBRA20054270, FEBRA20057260, 

FEBRA20062700, FEBRA200631 50, FEBRA20063540, FEBRA20064760, 

FEBRA20067930, FEBRA20068730, FEBRA20069420, FEBRA200701 70, 

FEBRA20074140, FEBRA2007551 0, FEBRA20075660, HSYRA20006400, 
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HSYRA20002480, IMR3220014350, MESAN20001490, NHNPC20002060, NT2NE1 00001 80, NT2NE20003920, 
NT2RI20006690, NT2RI20013B50, NT2RI20014500, NT2RI20025540, NT2RI20033010, NT2RI20075720, 
NT2RP60000350, NT2RP60001230, NT2RP70028750, NT2RP70029060, NT2RP70032030, NTONG20003340, 
OCBBF20000130, OCBBF20002310, 

OCBBF20009040, SKNMC20000650, SMINT20003960, TESTI20026320, TESTI20080200 

[0275] The result of comparative analysis of cDNA libraries derived from fetal heart (FEHRT) and aduit heart (HEART) 
showed that the genes whose expression levels were different between the two were the following clones (Table 37). 
KIDNE20062480, NT2RI20033040, NT2RP60000350, BGGI1 2001 0970^ BRACE1 0000420, BRACE10001150, 
BRACE20003320, BRACE20077980, BRAWH1 0000370, BRAWH20000340, BRAWH20011 660, BRAWH20014840, 
FEBRA20008740, FEBRA20072800, HEART20000350, HEART20000990, HEART20003090, HEART200041 1 0, 
HEART20004480, HEART20004920, 

HEART20005060, HEART20005200, HEART20005680, HHDPC20001150, HLUNG2000501 0, HSYRA20014200, 
IMR3220013170, KIDNE20004970, NT2R 120000640, NT2RI20006710, NT2RI20015400, NT2RI20026540, 
NT2RI20037510, NT2RI20057230, NT2RI20064120, NT2RI20071330, NT2RI20071480, NT2RI20077540, 
NT2RI20084810, NT2RI20087910, 

NT2RP70000760, NT2RP70024500, NT2RP70029060, NT0NG1 0001 820, PLACE60012810, PLACE600431 20, 
PROST2000053Q, SKMUS1 0000640, SKIVIUS2000458O SKMUS2001 501 0, SMINT20002770, TESTI20033250, 
TESTI20074640, UIVIVEN20001330 

[0276] The result of comparative analysis of cDNA libraries derived from fetal l<idney (FEKID) and adult kidney 
(KIDNE) showed that the genes whose expression levels were different between the two were the following clones 
(Table 38). 

3NB6920003300, 3NB6920009120, BGGI120010970, BRACE2000421 0, BRACE20005250, BRACE20011170, 
BRACE20020910, BRACE20026850, BRACE20080970, BRAWH20000340, BRAWH20006970, BRAWH20011660, 
FCBBF20001950, FEBRA20021940, FEBRA20043250, HLUNG1 0000640, IMR3220007420, IMR3220014350, 
KIDN El 0000280, KIDNE1 0000500, 

KIDNE1 0001 040, KIDNE10001430, KIDNE1 0001 450, KIDNE10001520, KIDNE20000410, KIDNE20000510, 
KIDNE20000700, KIDNE20000850, KIDNE20001 670, KIDNE20001920, KIDNE20002440, KIDNE20002450, 
KIDNE20002660, KIDNE200031 50, KIDNE20003300, KIDNE20003490, KIDNE20003750, KIDNE20004030, 
KIDNE20004220, KIDNE20004970, 

KIDNE20005130, KIDNE200051 70, KIDNE20005190, KIDNE20005740, KIDNE20031 850, KIDNE20033050, 
KIDNE20033350, KIDNE20033570, KIDNE20033730, KIDNE20033770, KIDNE20037520, KIDNE20039410, 
KIDNE20039940, KIDNE20040340, KIDNE20040540, KIDNE20040840, KIDNE20042620, KIDNE20042940, 
KIDNE20042950, KIDNE20043440, 

KIDNE20045200, KIDNE20045340, KIDNE20045790, KIDNE20046810, KIDNE20048280, KIDNE20048640, 
KIDNE20048790, KIDNE20049810, KIDNE20050420, KIDNE20052960, KIDNE20053360, KIDNE20054000, 
KIDNE20054770, KIDNE20056290, KIDNE20056760, KIDNE20059080, KIDNE20059370, KIDNE200601 40, 

KIDNE20060300, KIDNE20060530, 

KIDNE20060620, KIDNE20061490, KIDNE20062480, KIDNE20062990, KIDNE20063530, KIDNE20063760, 
KIDNE20066520, KIDNE20067600, KIDNE20067750, KIDNE20068800, KIDNE20070050, KIDNE20070770, 
KIDNE20071860, KIDNE20073280, KIDNE20073520, KIDNE20073560, KIDNE20074220, KIDNE20075690, 
KIDNE20078100, KIDNE20078110, 

LIVER1 0000790, MAMGL1 0000320, NB9N41 0000470, NT2NE2005371 0, NT2RI2000671 0, NT2RI2001 3420, 
NT2RI20016570, NT2RI20018460, NT2RI20025540, NT2RI20040590, NT2RI20065530, NT2RI20087490, 
NT2RI20087910, NT2RP60000350, NT2RP60001230, NT2RP70043730, NT2RP70069860, NT2RP70074220, 
OCBBF20014940, PLACE60014430, 

PLACE60020840, PLACE600431 20, PROST1 0003430, SKNMC20000970, SKNSH2000151 0, SMINT1 00001 60, 
SMINT20003960, SPLEN20000470, SPLEN20001340, SPLEN20003570, ST0MA1 0000470, SYNOV10001280, 
TEST1 10000700, TESTI20027070, TESTI20040310, TRACH1 0000300, TRACH20000790, TRACH20002500, 
TRACH20007800, KIDNE1 0000080, 

KIDNE20044110, NT2RI20033040, NT2RI2003751 0, NT2RP70065270, TRACH20012890 

[0277] The result of comparative analysis of cDNA libraries derived from fetal lung (FELNG) and adult lung (HLUNG) 
showed that the genes whose expression levels were different between the two were the following clones (Table 39). 
BNGH41 0001 980, BRACE1 0000420, BRACE10001150, BFiACE20014770, BRACE2001 8550, BRAWH20006970, 
BRAWH20014610, FEBRA2000881 0, FEBRA20015840, FEBRA20044120, HHDPC20001490, HLUNG1 0000240, 
HLUNG10000300, HLUNG1 0000370, HLUNG1 0000640, HLUNG1 0000760, HLUNG1 0000990, HLUNG1 0001 050, 
HLUNG1 0001 100, HLUNG20000680, 

HLUNG20001160, HLUNG20001250, HLUNG20001420, HLUNG20001760, HLUNG20002550, HLUNG20003140, 
HLUNG20004120, HLUNG20004800, HLUNG20005010, HSYRA200 14200, KIDNE20002660, KIDNE20033050, 
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NT2NE20014350, NT2RI20016570, NT2RI20026540, NT2RI20051500, NT2RI20064120, NT2RI20083960, 
NT2RI20085260, NT2RI20087490, 

NT2RP70009060, NT2RP70011660, NT2RP70029060, NT2RP70055020, NT2RP70074220, NT2RP700761 00, 
NT0NG1 0002460, NTONG20008000, PLACE60043120, SKMUS20016340, SKNMC20005930, SIVIINT200001 80, 
5 SMINT20002390, SMINT20002770, SMINT20003960, ST0MA1 0000470, STOMA20001880, SYNOV20013740, 
TESTI20036250, TESTI20080200, 
TRACH20004610, BRACE20004210, IMR3220007420 
[0278] These genes are involved in regeneration of tissues and/or ceiis. 

10 EXAiVIPLE 8 

Expression frequency anaiysis by PGR 

[0279] Specific PGR primers were prepared based on the full-length nucleotide sequences, and the expression fre- 
15 quency was analyzed by the ATAC-PCR method (Adaptor-tagged competitive PGR method: Nucleic Acids Research 
1997, 25(22): 4694-4696; "DNA Micro-array and Advanced PGR Techniques", Geil Technology, supplement, Eds., 
Muramatsu and Nawa (Shujunsha, 2000): 104-112). Inflammation-related genes can be identified by revealing the 
genes whose expression levels are altered depending on the presence of an Inflammation-Inducing factor. Then, by 
using THP-1 cell line, which Is a cell line of monocyte line, andTNF-a and LPS, both of which are inflammation-inducing 
20 factors, suitable for this system, the genes whose expression levels are altered depending on the presence of the 
factors were searched for by the system. 

[0280] THP-1 cell line (purchasedfromDAINIPPON PHARMAGEUTIGAL)wasculturedtobeconfluentlnRPMI1640 
medium (sigma) containing 5% fetal calf serum (GIBCO BRL). Then, the medium was changed with the medium con- 
taining 1 0 ng/ml TNF-a (human recombinant TNF-a; Phannacia Biotech) or 1 jig/mL LPS (Lipopolysaccharides; sigma), 
25 and the culture was continued at 37°C under 5% CO2. After three hours, the cells were harvested, and total RNA was 
extracted from them by using ISOGEN reagent (Nippon Gene). The extraction was carried out according to the method 
in the document attached to ISOGEN reagent. In addition, total RNA was also extracted from the cells cultured without 
stimulation of TNF-a or LPS. 

[0281 ] The genes involved in the onset of gastritis and gastroduodenal ulcer induced by the infection of Helicobacter 
30 py/orftothe epithelia of stomach can be identified by revealing the genes whose expression levels are altered depending 
on co-culturing the cells with Helicobacter pytori. Then, by using co-culture of a gastric cancer cell line with Helicobacter 
pylori, suitable for this system, the genes whose expression levels are altered depending on the presence of Helico- 
bacter pylori, were searched for by the system. 

[0282] A gastric cancer cell line MKN45 (provided by the Cell Bank, RIKEN GENE BANK, The Institute of Physical 
35 and Chemical Research) was cultured to be confluent in RPI\/II1640 medium (sigma) containing 10% fetal calf serum 
(GIBCO BRL). Then, the medium was changed with the medium containing 100-fold excess (in terms of the number 
of cells or the number of colonies) of Helicobacter pylori (TN2 strain: provided by Prof. Omata, Faculty of Medicine, 
The University of Tokyo), as compared with the number of the cancer cells. The culture was continued at 37°C under 
5% GO2. Afterthree hours, the cells were harvested, and total RNA was extracted from them by using ISOGEN reagent 
40 (Nippon Gene). The extraction was carried out according to the method in the document attached to ISOGEN reagent. 
In addition, total RNA was also extracted from the cells cultured without Helicobacter pylori. 

[0283] The analysis by the ATAC-PCR method was carried out basically according to "DNA Micro-array and Ad- 
vanced PGR Techniques", Cell Technology supplement (Genome Science Series 1, Eds., Muramatsu and Nawa (Shu- 
junsha, 2000): 104-112). Adapter ligation to the internal standard sample (sample to make the calibration curve for the 
45 clone of interest) and test sample was carried out In the two separate reaction systems indicated below. The combination 
of 6 types of adapters (AD-1 , AD-2, AD-3, AD-4, AD-5 and AD-6: see the sequences indicated below) and the samples 
are as follows. 

Reaction system A 

AD1; Internal standard, 10-fold 
AD2; THP-1 cells, unstimulated 
AD3; Internal standard, 3-fold 
AD4; THP-1 cells, TNF-a stimulation 
55 AD5; THP-1 cells, LPS stimulation 

AD6; Internal standard, 1-fold 

Reaction system B 
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AD1; internal standard, 1-fold 

AD2; MKN45 cells, unstimulated 

AD3; internal standard, 3-fold 

AD4; IVIKN45 cells, co-cultured with Helicobacter pylori 

ADS; internal standard, 10-fold 

Adapter sequences: 



ADl; 

SEQ ID NO: 3283//5 '-GTACATATTGTCGTTAGAACGCG-3 ' 

SEQ ID NO: 3284//3 ' -CATGTATAACAGCAATCTTGCGCCTAG-5 ' 
AD2 ; 

SEQ ID NO: 3285/ /5 ' -GTACATATTGTCGTTAGAACGCGACT-3 ' 

SEQ ID NO: 3286//3 '-CATGTATAACAGCAATCTTGCGCTGACTAG-5' 

AD3; 

SEQ ID NO: 3287//5 ' -GTACATATTGTCGTTAGAACGCGCATACT-3 ' 

SEQ ID NO: 3288//3 '-CATGTATAACAGCAATCTTGCGCGTATGACTAG-5 ' 
AD4; 

SEQ ID NO: 3289//5 '-GTACATATTGTCGTTAGAACGCGATCCATACT-3 ' 

SEQ ID NO: 3290//3 '-CATGTATAACAGCAATCTTGCGCTAGGTATGACTAG-5 ' 
ADS; 

SEQ ID NO: 3291//5'-GTACATATTGTCGTTAGAACGCGTCAATCCATACT-3 ' 

SEQ ID NO: 3292//3'-CATGTATAACAGCAATCTTGCGCAGTTAGGTATGACTAG-5' 

AD6; 

SEQ ID NO: 3293//5'-GTACATATTGTCGTTAGAACGCGTACTCAATCCATACT-3' 

SEQ ID NO: 3294//3 '-CATGTATAACAGCAATCTTGCGCATGAGTTAGGTATGACTAG- 



[0284] The internal standard sample used for this assay was a nnixture of total RNAs of THP-1 Control, IVIKN45 
Control, NT2 (Stratagene; catalog No, 204101). RNA preparation from the culture cells was carried out according to 
the standard method, 

[0285] The sequences of primers specific to the genes and the names of clones of interest in the analysis are as 
follows. The gene specific primers were designed to produce the PGR products of 70 to 200 bp, which are derived 
from the adapter-containing cDNA. The sequence of adapter-specific primer (labeled with fluorescence (FAIVI)) used 
In the competitive PGR was GTAGATATTGTCGTTAGAAGGC (22 nucleotides; SEQ ID NO: 3295). PGR was basically 
carried out with a cycling profile of preheating at 94°C for 3 minutes, and 30 cycles of denaturatlon at 94°G for 30 
seconds/annealing at 50°C for 60 seconds/extension at 72°C for 90 seconds; In some cases, merely the annealing 
temperature was changed. 

The nucleotide sequences of clone specific primers used In the experiments 

[0286] Clone name, primer sequence and SEQ ID NO are Indicated below In this order. Each Is demarcated by a 
double slash mark (//). 
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3NB6920000290//CTCCTCCAGCAGAACTTG//SEQ ID NO; 3296 


ADRGLl 0000180 / / TTT AG AGCTGMTCCCCATT/ / SEQ 


ID 


NO: 


3297 


BNGH4 1 0001370// T AAAAGCAGG AAATTGT AAA/ /SEQ 


ID 


NO: 


3298 


BRACE 10001590/ / ATATGGACAAAGGACCAATT / / SEQ 


ID 


NO: 


3299 


BRACE 10001690/ / AGGACTAGATTCACTGCTTA/ /SEQ 


ID 


NO; 


3300 


BRACE2 0 010650// CAACTCTCAACACCACAATC/ / SEQ 


ID 


NO: 


3301 


BRACE2 0 013400// CTACTCAAGGACAGCCACAC/ / SEQ 


ID 


NO: 


3302 


BRACE 20030780// AG AT AG AGGCTTGCTGGTGT / / SEQ 


ID 


NO: 


3303 


BRACE2 0034490// CCTTATGTCAAACTGCGATT/ / SEQ 


ID 


NO: 


3304 


BRACE2 0077640/ /TTTGCCTTATTCATTGGTTG/ / SEQ 


ID 


NO: 


3305 


BRACE 2 0 07 9530/ /GT AATATCACCCCACAGAGG/ / SEQ 


ID 


NO: 


3306 


BRACE 20083850// TATCATCTTTTGGGGCTTTG/ / SEQ 


ID 


NO: 


3307 


BRACE2 0 09188 0/ / AATAAGCCAGTTGCATCCTC/ / SEQ 


ID 


NO: 


3308 


BRAWHl 0001620// TCTCTC ATCTCCAAACATGC/ / SEQ 


ID 


NO: 


3309 


BRAWH2 0004430// TG AATTG AAAGAGAC ACACT / / SEQ 


ID 


NO: 


3310 


FCBBFl 0 006180// CTTT^TCCAGTTCATCAGCT/ / SEQ 


ID 


NO: 


3311 


FEBRA2 0003780/ / TTTTGAGACAGAGTTTCGCT/ / SEQ 


ID 


NO: 


3312 


FEBRA2 0006800/ / ATGTTTTACGATTGCCTTTG/ / SEQ 


ID 


NO: 


3313 


FEBRA2 0008810// GAAGC ATCTTTGGTGT ACTA/ /SEQ 


ID 


NO: 


3314 


FEBRA2 0012940// TGTCCCTGGAAAGT AATATA/ /SEQ 


ID 


NO: 


3315 


FEBRA2 00158 4 0/ / AACACAGTAGCCAGAACCAG/ / SEQ 


ID 


NO: 


3316 


HCASMl 0000610 / / AAGAGCCTACTACACGCCAG/ / SEQ 


ID 


NO: 


: 3317 


HEART2 0000350 / /TTTAAGAGCACACAGAAGTC/ / SEQ 


ID 


NO: 


3318 


HE ART2 0004480// ATTACTGGTGTGGAGTGGGT/ / SEQ 


ID 


NO: 


3319 


HEART2 0 005 06 0/ / ACTCTGCCTTCACTTTCCTT/ / SEQ 


ID 


NO: 


3320 


HHDPC20000950// GATAAAGGATAC AGCCAAAA/ /SEQ 


ID 


NO: 


3321 


HLUNG10000370//ATCATGGTCGTTACAGAATT//SEQ 


ID 


NO: 


3322 


HLUNG2 0001160/ / ACTGCCTTCAATCTCAGGTT/ / SEQ 


ID 


NO: 


3323 


HLUNG2 0 0 0 17 6 0/ / ATCACTGCCAATTTCACAAA/ / SEQ 


ID 


NO: 


3324 



HSyRA20003470//CCACCGAGTTCTGTTG//SEQ ID NO: 3325 
HSyRA20013320//GTCATGGCCACAGTTGTATC//SEQ ID NO: 3326 



55 



EP 1 293 569 A2 



lMR3210001580//GATAAAGGATACAGCCAiy5A//SEQ ID NO: 3327 
IMR3210002660//CCCAAAATGTGTATTATTCA//SEQ ID NO: 3328 
IMR3220008380//TTCGGCAATAATCTTCTCTT//SEQ ID NO: 3329 
IMR3220008590//CCACCAACACTTAGACATCA//SEQ ID NO: 3330 
KIDNE10001520//GAATTATAGGTGCACAACAC//SEQ ID NO: 3331 
KIDNE20000850//TCTTCTAGTGGAAGAGGTTTA//SEQ ID NO: 3332 
KIDNE20003490//TATCTGAARATGTGTTTGGT//SEQ ID NO: 3333 
KIDNE20005170//ACTCCTGGCTTTCTATTTCC//SEQ ID NO: 3334 
KIDNE20033730//GACAGTCTCGCTGTATCTCC//SEQ ID NO: 3335 
KIDNE20040540//ACATCCAGTACACCTTCTCC//SEQ ID NO: 3336 
KIDNE20050420//GTCGAAAGTGTTGCTCCTAG//SEQ ID NO: 3337 
KIDNE20061490//TCATAGCTGAGGGGTTAAGT//SEQ ID NO: 3338 
KIDNE20062990//ATAGCTCTTGTTTCAGTGTG//SEQ ID NO: 3339 
LIVER20000330//AAGCATGTGGGAGTTATTTA//SEQ ID NO: 3340 
NT2NE10001630//CTTGAGAGTCCAGGTTTCCT//SEQ ID NO: 3341 
NT2NE10001850//CCCATAAAGAATAGAAGCTC//SEQ ID NO: 3342 
NT2NE20003920//CTCATGGGGCTAAGTCTATT//SEQ ID NO: 3343 
NT2NE20005500//TCAAAGTCCAGGATAGCATT//SEQ ID NO: 3344 
NT2RI20009740//ACTGATTTGGTTCTGCGATT//SEQ ID NO: 3345 
NT2RI2 0014 500/ /CTTACTTCGAGTTCTAGCAC//SEQ ID NO: 3346 
NT2RI20016570//TGCTGCTCATGTTAAACTTG//SEQ ID NO: 3347 
NT2RI2 001 8660/ /AAACATCATCTCTTCCTTGG//SEQ ID NO: 3348 
NT2RI2002152Q//GCTGAAGAGAACAATAAGTC//SEQ ID NO: 3349 
NT2RI20050870//GACAGAGTAGTGGGGCATCT//SEQ ID NO: 3350 
NT2RI20053350//TTCAGCAGGTAGACAACATC//SEQ ID NO: 3351 
NT2RI20070480//CCTCTCTTTCAGTTGAGCAT//SEQ ID NO: 3352 
NT2RI20073030//GGGCTTGTTTTACGC//SEQ ID NO: 3353 
NT2RI20078270//CCTAGGCAGTAAC:ATGAAAA//SEQ ID NO: 3354 
NT2RI20078790//GCAGACAGGTACAGCTGAGT//SEQ ID NO: 3355 
NT2RI20083360//TTATTTTAGTTACCTTGGCA//SEQ ID NO: 335S 
NT2RP6 0000080/ /ACTGT7^TCTCCTTGCCTT//SEQ ID NO: 3357 
NT2RP60000390//GAGTTTGGGGACAGTCAAGT//SEQ ID NO: 3358 
NT2RP60000590//AAATGCAAAATTGCTGAGAT//SEQ ID NO: 3359 
NTONG10000980//TTCAGCAGGTAGACAACATC//SEQ ID NO: 3360 
NTONG10b02570//GTCGCTGAAATTTGCTTCTT//SEQ ID NO: 3361 
PLACE60020160//CCATATCCACTTTCATCATC//SEQ ID NO: 3362 
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P LACE 60026990// C AAG AAACTGAC AATC ACGG / / S E Q 


ID 


NO: 


3353 




P LACE 60047380// AAGG AGTTG ACATTTTGCT G / / SE Q 


ID 


NO: 


3364 


5 


PUAEN10003220//TTTTCAGAGGGCTTTGTGTT//SEQ 


ID 


NO: 


3365 




SKNMC 10000290 / / AT AACTG AACCC ATGGAAAG/ / SE Q 


ID 


NO: 


3366 




S KNMCl 0 0 0 1 5 9 0 / / ACATCCAGT ACACCTTCTCC / / SE Q 


ID 


NO: 


3367 


10 


SKNMC 2 0000650/ /GCACTAGGAGACTGTCAAAA/ / SEQ 


ID 


NO: 


3368 




STOMA2000257 0//GGTATCTTGGAGCTCCTCAG//SEQ 


ID 


NO: 


3369 




STOMA20002890//GTCAGCATCTACTCTGGGTC//SEQ 


ID 


NO: 


3370 


15 


S YN0V2 0001770/ / AAGAAATAAAC AC ACGAAAA/ /SEQ 


ID 


NO: 


3371 




TE ST I 1 0 0 00 2 3 0 / /AAATGCAAAATTGCTGAGAT / / SEQ 


ID 


NO: 


3372 




TESTI 100 00550 //CAGAACACTCCTCATACCTC// SEQ 


ID 


NO: 


3373 




TESTI20011340/ / AAAGTACAGCAGAAGATGGG/ /SEQ 


ID 


NO: 


3374 




THYMUl 0005580/ / AACAGCTTCTTC ATCACAGT / / SEQ 


ID 


NO: 


3375 




TRACHl 0000630/ / AT AGAGGAAGGTGGCAACTG/ /SEQ 


ID 


NO: 


.3376 




TRACH2 0001960 / /CTCTTTTCCATCACATTCCC/ /SEQ 


ID 


NO: 


3377 


25 


UMVENl 0001220/ /CCAAGTTCTCATTCCACATT/ / SEQ 


ID 


NO: 


3378 




UMVEN2 0001330/ / AGCT AAC AAGGTTTTG ACAC/ /SEQ 


ID 


NO: 


3379 




UTERa2 0004850/ / AGACTGGGTCTTGCCATACT/ / SEQ 


ID 


NO: 


3380 



30 

[02B7] The result of expression frequency analysis is shown in Table 40. The clones not shown in the table contain 
clones whose expression levels could not be measured because the levels were too low or the sizes of the PGR 
products were different fronn the expected. It was confirmed that the expression levels of TNF-a, IL-1, and IL-B genes 
used as positive control genes were elevated. 
35 [0288] The result obtained by the search for the genes whose expression levels were altered depending on the 
presence of TNF-a or LPS in culturing THP-1 cell, which is a human monocyte cell line, showed that the clones whose 
expression levels were elevated by twofold or more depending on the TNF-a stimulation (the clones whose expression 
levels were 0.1 or lower both before and after the stimulation were excluded), were 

ADRGL1 00001 80, BRACE20030780, BRACE20077640. BRACE20083850, BRAWH20004430, FCBBF1 00061 80, 

40 FEBRA20003780, FEBRA20006800, FEBRA20012940, FEBRA20015840, HEART20004480, HLUNG1 0000370, 
HLUNG20001160, HSYRA20013320, IMR3220008380, KIDNE1 0001 520, KIDNE20040540, KIDNE20061 490, 
KIDNE20062990, NT2NE1 0001 630, NT2NE20003920, NT2NE20005500, NT2RI2001 4500, NT2RI2001 6570, 
NT2RI20078270, NT2RI20083360, NTONG1 0002570, PUAEN 10003220, SKNIVIG1 0000290, STOMA20002570, 
TESTI20011340, UTERU20004850. 

45 [0289] Further, the clones whose expression levels were elevated by twofold or more depending on the LPS stimu- 
lation (the clones whose expression levels were 0.1 or lower both before and after the stimulation were excluded), 
were FCBBF1 00061 80, FEBRA2001 5840, HLUNG1 0000370, HLUNG20001160, HSYRA20013320, 
KIDNE20040540, KIDNE20061490, NT2NE1 0001 630, NT2NE20003920, NT2NE20005500, NT2RI20014500, 
NT2RI2001 6570, NT2RI20078270, NT0NG1 0002570, PUAEN1 0003220, STOIVIA20002570, TESTI2001 1 340. These 

50 genes whose expression levels were elevated by LPS stimulation, were all up-regulated by the TNF-a stimulation. 
[0290] On the other hand, with respect to the genes whose expression is suppressed, in particular cases where the 
expression levels were relatively high in the unstimulated cells (the relative value were 1 or higher), the clones whose 
expression levels were decreased by twofold or more by the TNF-a stimulation, were BRAGE20013400, 
BRACE20091880, HEART20005060, HLUNG20001 760, i[VlR3220008590, NT2NE1 0001 850, NT2RI2001 8660, 

55 NT2RI20053350, NT2RI20070480, PLACE60047380, STOiV1A20002B90, SYNOV20001 770, TRAGH20001 960. Fur- 
ther, when the levels were normalized by using the ratio of the expression level of p-actin widely used in data normal- 
ization for gene expression level, the clones whose expression levels were decreased by tenfold or more depending 
on the LPS stimulation, were BRACE20013400, BRACE20091880, HEART20005060, HLUNG20001760, 
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NT2RI20070480,UMVEN20001 330. Among the genes whose expression levels were decreased by TNF-a stimulation, 
the genes whose expression levels were also decreased by the LPS stimulation were BRACE20013400, 
BRACE20091880, HEART20005060, HLUNG20001760, NT2RI20070480. 

[0291] These clones were thus revealed to be involved in the inflammation reaction induced by TNF-a or LPS. 
[0292] The result obtained by the search for the genes whose expression levels were altered depending on co- 
culturlng gastric cancer cell line MKN45 with Helicobacter pylori, showed that the clones whose expression levels were 
elevated by twofold or more depending on the presence of Helicobacter pylori (the clones whose expression levels 
were 0.1 or lower both before and after the stimulation were excluded), were BRACE1 0001590, BRACE20079530, 
BRAWH1 0001 620, FEBRA20006800, KIDNE20003490, KIDNE20040540, KIDNE20050420, NT2NE1 0001 850, 
STOMA20002890, SYNOV20001770. TESTI1 0000550, UTERU20004850. Of the clones, FEBRA20006800, 
KIDNE20040540 and UTERU20004850 were also up-regulated by TNF-a stimulation in the human monocyte cell line 
THP-1. 

[0293] On the other hand, with respect to the genes whose expression is suppressed, in particular cases where the 
expression levels were relatively high in the unstimulated cells in (the relative value were 1 or higher), when the levels 
were normalized by using the ratio of the expression level of p-actin widely used in data normalization for gene ex- 
pression level, the clones whose expression levels were decreased by fivefold or more in the presence of Helicobacter 
pylori, were BRACE20034490, BRACE20077640, BRACE20083850, KIDNE20005170, LIVER20000330, 
NT2RP60000390, NT0NG1 0000980, UMVEN20001330. 
[0294] These clones are involved in gastritis or gastroduodenal ulcer. 

Table 3 



Clone ID 

KIDNE20062480 

NT2R1 2001 6570 

PLAGE60020840 

3NB6920002810 

BRAGE20035270 

BRAWH20000340 

FEBRA20062700 

HSYRA20011030 

NT2RP70030910 

OCBBF20011240 

PLACE60043120 

SYNOV20011440 

HCASM10001150 

IMR3220016000 

NT2RI200B2210 

D30ST20001840 

FEBRA20012940 

FEBRA20021910 

IMR3220002230 

IMR3220012180 

NT2R1 20000640 

NT2R1 2001 0910 

NT2R120058110 

NT2RP60000350 

NT2RP70011660 

PEBLM20003950 

PLACE60049310 

PROST2006260 

TEST1 20007840 

TEST1 20040310 

TESTI20080200 



CD34C D30ST D60ST D90ST 



7.035 
8.776 



10.991 
18.197 



46.971 
20.306 



14.226 
68.294 
66.053 



5.748 
8.24 
20.558 
4.74 
33.245 
40.521 
35.533 

9.617 
39.804 
44.145 
15.442 

15.55 
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Table 3 (continued) 



Clone ID 

THYM Uf 0003590" 
TRACH 10000630 
TRACH20007800 
CD34C20000510 
HSYRA20016210 
KIDNE20004030 
KIDNE20073280 
NT2RP70055020 
PLACE60043960 
SKMUS1 0000220 



54.663 
58.521 
58.654 
57.743 



34.338 
18.508 



Clone ID 



NT2RM NT2RP 



SKNMC20000970 

3NB6920009120 

BRAWH20006970 

KIDNE20062480 

NHNPC20002060 

NT2NE20053710 

NT2R1 20000640 

NT2R1 20004210 

NT2RI20006710 

NT2RI20009740 

NT2RI20013420 

NT2RI20013850 

NT2RI20014100 

NT2RI20025410 

NT2RI20033040 

NT2RI20035560 

NT2RI20036950 

NT2RI20051500 

NT2RI20053350 

NT2RI20057230 

NT2RI20071330 

NT2RI20075720 

NT2RI20083960 

NT2RI20087910 

NT2RI20090650 

NT2RI20094060 

NT2RP60000350 

NT2RP70000760 

NT2RP70036800 

NT2RP70071770 

NT2RP70074220 

TESTI20007840 

TESTI20080200 

3NB6920002810 

3NB6920005450 



2.312 
2.916 
4.004 
1.503 
1.674 
7.977 
1.53 

29.541 
2.481 

13.985 
4.074 
6.297 

53.957 

17.546 
3.342 

28.514 
4.691 
3.392 
2.329 
2.418 

13.599 
4.321 
8.389 
5.711 
2.967 

26.699 
5.37 
4.602 
6.556 
18.93 
4.004 

19.261 
1.375 
1.239 



0.784 
1.978 
2.717 
7.137 
1.136 
0.773 
1.038 

40.087 
1.683 
9.489 
1.382 
4.273 

18.305 

11.905 
3.401 
3.869 
9.549 
9.205 
1.581 
1.641 
9.227 
2.932 
5.692 
1.938 
6.039 

18.115 
4.554 
3.122 
8.897 
6.422 
2.717 
3.267 
1.867 
0.841 
4.884 
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Table 4 (continued) 


Clone ID 


NT2RIV1 


NT2RP 


NT2RI 


NT2NE 


HSYRA20015740 


° 








HSYRA20016210 




2353 


^277 




IMR3220016000 




1 458 


0 989 




KIDNE20060140 


0 


14.005 


9.502 


0 


NT2R1 2001 4490 


0 


29.632 


70.368 


0 


NT2R1 2001 5950 


° 






° 


NT2RI20022520 




59 577 


40423 




NT2RI200251 70 




59 577 


40 423 




NT2RI20025540 




5 634 


3 823 




NT2R1 20030510 


0 


15.012 


10.186 


0 


NT2RI20040590 


0 


13.827 


9.381 


0 


NT2RI20046060 


0 


9.661 


13.11 


0 


NT2RI20053680 


° 


3.443 




° 


NT2RI20058510 






16 726 




NT2RI20066820 


0 


32.943 


67.057 


0 


NT2RI20067030 


0 


59.577 


40.423 


0 


NT2RI20074980 


0 


6.002 


24.433 


0 


NT2RI20075890 


0 


12.002 


8.144 


0 


NT2RI20078a40 


0 


22.336 


7.578 


0 


NT2RI20084810 


0 


12.646 


12.87 


0 


NT2RI20089420 


0 


17.837 


18.153 


0 


NT2RP70002380 


0 


44.979 


15.259 


0 


NT2RP70023790 


0 




1 .549 


0 


NT2RP70029820 


0 


15 7^3 


10.661 


0 


NT2RP70049150 


0 


59.577 


40.423 


0 


NT2RP70055020 


0 


4.911 


3.332 


0 


NT2RP70065270 


0 


1.664 


1 .129 


0 


NT2RP70069860 


0 


10.864 


7.371 


0 


NT2RP70075370 


0 


7.229 


1.635 


0 


NT2RP70079750 


0 


3.104 


2.106 


0 


NT2RP70092590 


0 


17.381 


1 1 .793 


0 


OCBBF20000130 


0 


7.346 


19.936 


0 


PLACE60043970 


0 


12.827 


4.352 


0 


TESTI20053960 


0 


24.609 


5.566 


0 


BNGH420004740 


0 








HSYRA20002480 


0 


2 789 




2 867 


NT2NE1 0000730 




24 483 




75 517 


NT2NE20000560 


0 


4.458 


0 


4.584 


NT2NE20003270 


0 


28.131 


0 


14.462 


NT2NE20008090 


0 


34.048 


0 


17.504 


NT2NE20014030 


0 


17.216 


° 




NT2RP60000720 








^3 015 


NT2RP60001090 


0 


3.47 


0 


3.568 


NT2RP70004770 


0 


22.083 


0 


11.353 


NT2RP70010800 


0 


59.331 


0 


40.669 


NT2RP70011660 


0 


5.472 


0 


2.813 


NT2RP70028750 


0 


7.823 


0 


1.609 


NT2RP70029060 


0 


3.063 


0 


9.449 


NT2RP70030550 


0 


49.305 


0 


50.695 


NT2RP70032030 


0 


8.843 


0 


4.546 



116 



EP 1 293 569 A2 



Table 4 (continued) 



Clone ID 

"Nt2RP70036320 " 
NT2RP70064900 
NT2RP70093220 
NT2RP70093730 
SYNOV20013740 
TEST 120021490 
TRACH20004720 
TRACH20007800 
3NB6920003300 
BRACE1 0000200 
BRACE20018550 
FEBRA20008740 
FEBRA20074580 
FEBRA20076220 
KIDNE20073520 
MAMGL1 0000320 
NT2NE20002140 
NT2NE20006360 
NT2NE20007870 
NT2NE20009800 
NT2NE20035690 
NT2RI20002940 
NT2RI20014500 
NT2RI20016210 
NT2RI20029260 
NT2RI20037510 
NT2RI20055640 
NT2RI20064120 
NT2RI20074390 
NT2RI20077230 
NT2RI20090660 
PLACE60040050 
TRACH20012890 
3NB691 0001 730 
BRAG E 1000 11 50 
BRACE20011170 
BRACE20020910 
BRACE20035270 
BRAWH20005220 
FEBRA20003970 
FEBRA20012460 
HLUNG20003140 
IMR3220009350 
IMR3220013170 
IMR3220013320 
IMR3220014350 
NT2NE1 0000040 
NT2NE1 00001 40 
NT2NE1 00001 80 
NT2NE1 0000230 



32.803 
32.957 

29.95 
11.184 
20.062 



0.71 
18.865 
16.357 

2.493 
23.056 

6.122 

4.038 
0.39 

7.614 
18.047 

3.883 
39.756 
14.209 
12.546 

3.454 
21.272 
18.069 

3,361 
19,702 

2.899 
39.756 
14.643 

5,676 

3,883 

4,391 
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Table 4 (continued) 



Clone ID 

"Nt2NEf00b0630 " 
NT2NE1 0000830 
NT2NE1 0001 200 
NT2NE1 0001 630 
NT2NE1 0001 850 
NT2NE20000380 
NT2NE20000640 
NT2NE20001740 
NT2NE20002590 
NT2NE20002990 
NT2NE20003690 
NT2NE20003840 
NT2NE20003920 
NT2NE20004550 
NT2NE20004700 
NT2NE20005170 
NT2NE20005360 
NT2NE20005500 
NT2NE20005860 
NT2NE20006580 
NT2NE20007060 
NT2NE20007630 
NT2NE20008020 
NT2NE20011560 
NT2NE20012470 
NT2NE20013240 
NT2NE20013370 
NT2NE20013640 
NT2NE20013720 
NT2NE20014280 
NT2NE20014350 
NT2NE20015300 
NT2NE20016230 
NT2NE20016260 
NT2NE20016340 
NT2NE20016480 
NT2NE20016660 
NT2NE20016970 
NT2NE20034080 
NT2NE20044900 
NT2NE20047160 
NT2NE20054410 
NT2NE20055170 
NT2NE20057200 
OCBBF20009040 
OCBBF20015B60 
PLACE60020840 
PR0ST1 0005260 
SKMUS20008630 
SMINT20003960 
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Table 4 (continued) 



Clone ID 

"Sf6MA2000T2"lO" 
SYNOV20011440 
TEST1 10000230 
TESTI20009700 
TESTI2004Q310 
THYMU 10003290 
TRACH20013950 
BGGI120010970 
BNGH41 0001 980 
BRACE1 0001 660 
BRACE20014770 
BRACE20034490 
BRACE20071740 
BRAWH20009440 
BRAWH20036930 
CTONG20020730 
CTONG20028030 
FCBBF1 0006750 
FCBBF20012110 
FCBBF20015380 
FEBRA20007570 
FEBRA20043250 
FEBRA20068730 
HCASM1 0001 150 
HCASM20002140 
HHDPC20000950 
HHDPC20004620 
HSYRA1 0001 370 
HSYRA1 0001 780 
HSYRA20001350 
HSYRA20006050 
IMR321 0001 580 
IMR3220002230 
IMR3220003020 
KIDNE20004030 
KIDNE20060300 
KIDNE20073280 
MESAN20005010 
NT2RI1 0000 160 
NT2RI1 0000270 
NT2RI1 0000480 
NT2RI1 0001 640 
NT2RI20002700 
NT2RI20002820 
NT2RI20003410 
NT2RI20004120 
NT2RI20005970 
NT2RI20006690 
NT2RI20006850 
NT2RI20007380 



2.974 
20.55 



6.10B 
27.924 
11.189 
19.379 
25.686 
26.479 
26.462 
19.363 
3.16B 
1.649 
8.353 
6.35 
1,43 
2,71 
4,733 
25.354 
10.795 
7.211 
7.505 
17.911 
5.369 
4.566 
4.387 
3.815 
1.994 
6.225 
24.967 
100 
38.568 
55,06 



6.222 
29.216 
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Table 4 (continued) 



Clone ID 

"Nt2RI20008650"' 
NT2RI20010100 
NT2RI20010830 
NT2RI20010910 
NT2RI20012350 
NT2RI20012440 
NT2RI20014090 
NT2RI20015190 
NT2RI20015400 
NT2RI20016570 
NT2RI20017260 
NT2RI20018460 
NT2RI20018660 
NT2R1 20020220 
NT2RI20020410 
NT2RI20021520 
NT2R1 20022430 
NT2RI20022700 
NT2RI20025300 
NT2RI20025850 
NT2RI20026540 
NT2RI20028020 
NT2RI20028520 
NT2RI20029580 
NT2RI20029700 
NT2RI20030110 
NT2RI20030190 
NT2RI20030670 
NT2RI20031540 
NT2RI20032050 
NT2RI20032220 
NT2RI20033010 
NT2RI20033380 
NT2RI20033440 
NT2RI20033830 
NT2RI20036780 
NT2RI20041900 
NT2R1 20042840 
NT2RI20043040 
NT2RI20043980 
NT2RI20044420 
NT2RI20047830 
NT2RI20048400 
NT2RI20049160 
NT2RI20049840 
NT2RI20049850 
NT2RI20050610 
NT2RI20050870 
NT2RI20056280 
NT2R1 20056470 



100 
0.383 
1.461 
73 33 
14.383 
50.642 
39.883 
100 
14.03 
22.43 
100 
25.585 
25.193 
5.201 
39.201 
62.741 



2.725 
28,31 



6.678 
31.421 
21.125 



21.943 
28.031 
16.393 



51.095 
17.16 
34.993 
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Table 4 (continued) 



Clone ID 

"Nt2Rf20058lTo"" 
NT2RI20060710 
NT2R1 20060720 
NT2R1 20061 270 
NT2RI20061830 
NT2RI20062100 
NT2RI20063450 
NT2RI20064870 
NT2RI20065060 
NT2RI20065530 
NT2RI20066670 
NT2RI20066790 
NT2RI20067350 
NT2RI20067880 
NT2RI20068250 
NT2RI20068550 
NT2RI20070480 
NT2RI20070840 
NT2RI20070960 
NT2RI20071160 
NT2RI20071480 
NT2RI20072140 
NT2RI20072540 
NT2RI20073030 
NT2RI20073840 
NT2RI20073860 
NT2RI20074690 
NT2RI20075070 
NT2RI20077290 
NT2RI20077510 
NT2R1 20077540 
NT2RI20078270 
NT2R1 20078790 
NT2RI20078910 
NT2R1 20080500 
NT2R1 20081 880 
NT2RI20082210 
NT2R1 20083360 
NT2RI20085260 
NT2R1 20085980 
NT2RI20086560 
NT2RI20087140 
NT2R1 20087490 
NT2RI2008B010 
NT2RI2008B120 
NT2RI20090830 
NT2RI20091440 
NT2RI20092150 
NT2RI20092890 
NT0NG1 0001 820 



NT2RI 
"f2"65"5" 
18.232 
36.029 
100 



56.418 
2.339 



39.94 
34.304 



39.213 
39.208 
9.741 



100 
2. 823 
5.348 
3.491 
2,71 
13.947 
50.642 
1.932 
100 
15.273 
24.964 
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Table 4 (continued) 



Clone ID 

"OGBBF20602770" 
OCBBF20011240 
PEBLM1 0001 440 
PLACE50001130 
PLACE60014430 
PROST20029600 
PUAEN1 0000570 
SALGL1 0001 570 
SKMUS1 0000220 
SKMUS20004670 
STOMA20002890 
SYNOV10001280 
TEST1 2001 2690 
TESTI20023690 
TEST1 20028660 
TEST1 20068720 
THYMU 10000020 
THYMU 10000830 
TRACH20002370 
3NB6910001290 
BRACE1 0000700 
BRACE20003320 
BRACE20015080 
BRACE20079020 
BRACE20083800 
BRACE20092740 
FEBRA20008810 
FEBRA20017150 
FEBRA20067930 
HHDPC20000550 
HSYRA20008280 
HSYRA20014760 
KIDNE1 0001 450 
KIDNE20000850 
KIDNE20002660 
KIDNE20003300 
KIDNE20033050 
KIDNE20045340 
NT2RP60000080 
NT2RP60000170 
NT2RP60000320 
NT2RP60000390 
NT2RP60000590 
NT2RP60000860 
NT2RP60001000 
NT2RP60001230 
NT2RP60001270 
NT2RP70000410 
NT2RP70000690 
NT2RP70002590 



NT2RI 
"3"9"88"3" 
7.83 
18.541 
16.492 
3.227 
40.276 
8.916 
1.34 
3.288 
2534 
1.705 
4.53B 
3.275 
44.022 
10.313 
21.478 
27.219 
895 
13.11 



22.972 
14.937 
26.275 
59.759 

4.248 
42.609 

9.264 
30.227 
58.409 
14.432 

7.137 

5.313 
19.263 

4.841 

7.078 
20.763 

1.709 
17.723 
100 
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Table 4 (continued) 


Clone ID 


NT2RIV1 


NT2RP 


NT2RI 


NT2NE 


NT2RP70002710 


° 








NT2RP70003640 










NT2RP70003910 




100 






NT2RP70004250 


0 


100 


0 


0 


NT2RP70005790 


0 


100 


0 


0 


NT2RP70006240 


° 




° 


° 


NT2RP70008120 




100 






NT2RP70009060 




8617 






NT2RP70012310 




24 404 






NT2RP70013060 


0 


49.874 


0 


0 


NT2RP700 13350 


0 


30.221 


0 


0 


NT2RP70015910 


0 


22.4 


0 


0 


NT2RP70018560 










NT2RP70021510 


0 


49.439 


0 


0 


NT2RP70022430 










NT2RP70023760 


0 


17.87 


0 


0 


NT2RP70024490 


0 


22.701 


0 


0 


NT2RP70024500 


0 


19.57 


0 


0 


NT2RP70025540 


0 


100 


0 


0 


NT2RP70026190 


0 


9.482 


0 


0 


NT2RP70028290 


0 


49.972 


0 


0 


NT2RP70028410 


0 


100 


0 


0 


NT2RP70030500 


0 


100 


0 


0 


NT2RP70030910 


0 


20.809 


0 


0 


NT2RP70033040 


0 


100 


0 


0 


NT2RP70036290 


0 


9.965 


0 


0 


NT2RP70036470 


0 




0 


0 


NT2RP70039600 


0 


15.306 


0 


0 


NT2RP70040800 


0 


100 


0 


0 


NT2RP70042040 


0 


9.773 


0 


0 


NT2RP70042330 


0 


60.574 


0 


0 


NT2RP70042600 


0 


71 .279 


0 


0 


NT2RP70043730 


0 


7.505 


0 


0 


NT2RP70043960 


0 


100 


0 


0 


NT2RP70045410 


0 


24.679 


° 


° 


NT2RP70046560 


0 








NT2RP70046870 




100 






NT2RP70047510 


0 


100 


0 


0 


NT2RP70047660 


0 


100 


0 


0 


NT2RP70047900 


0 


18.549 


0 


0 


NT2RP70049250 


° 




° 


° 


NT2RP70049750 




^^1^00 






NT2RP70052050 


0 


64.358 


0 


0 


NT2RP70052190 


0 


100 


0 


0 


NT2RP700546B0 


0 


100 


0 


0 


NT2RP70054930 


0 


49.38 


0 


0 


NT2RP70055130 


0 


3.248 


0 


0 


NT2RP70055200 


0 


19.133 


0 


0 


NT2RP70061620 


0 


100 


0 


0 


NT2RP70061880 


0 


20.642 


0 


0 
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Table 4 (continued) 


Clone ID 


NT2RM 


1MT2RP 


NT2RI 


NT2NE 


NT2RP70062960 


° 








NT2RP70063040 




100 






NT2RP70063740 




100 






NT2RP70064080 


0 


100 


0 


0 


NT2RP70066210 


0 


100 


0 


0 


NT2RP70067010 


° 




° 


° 


NT2RP70069800 




40 229 






NT2RP70071140 




33 145 






NT2RP70071540 




49 972 






NT2RP70072210 


0 


100 


0 


0 


NT2RP70072520 


0 


100 


0 


0 


NT2RP70073590 


0 


100 


0 


0 


NT2RP70073810 


° 




° 


° 


NT2RP70074060 




100 






NT2RP70075040 




100 






NT2RP70076100 


0 


36.014 


0 


0 


NT2RP70076170 


0 


100 


0 


0 


NT2RP70076430 


0 


100 


0 


0 


NT2RP70079250 


0 


100 


0 


0 


NT2RP70079300 


0 


71.279 


0 


0 


NT2RP70081330 


0 


100 


0 


0 


NT2RP70081370 


0 


26.129 


0 


0 


NT2RP70081420 


0 


100 


0 


0 


NT2RP70081440 


0 


100 


0 


0 


NT2RP70081670 


0 


100 


0 


0 


NT2RP70083150 


0 


100 


0 


0 


NT2RP70084060 


0 


100 


0 


0 


NT2RP70084410 


0 


65.611 


0 


0 


NT2RP70084870 


0 


48.444 


0 


0 


NT2RP70085500 


0 


100 


0 


0 


NT2RP70085570 


0 


9.069 


0 


0 


NT2RP70086230 


0 


100 


0 


0 


NT2RP70087200 


0 


100 


0 


0 


NT2RP70088550 


0 


15.625 


0 


0 


NT2RP70090120 


0 


49.497 


° 


° 


NT2RP70090190 


0 








NT2RP70091490 


0 


49 38 






NT2RP70091680 


0 


100 


0 


0 


NT2RP70092150 


0 


100 


0 


0 


NT2RP70092360 


0 


100 


0 


0 


NT2RP70093630 


° 




° 


° 


NT2RP70093700 




100 






NT2RP70093940 


0 


24.073 


0 


0 


NT2RP70093970 


0 


100 


0 


0 


NT2RP70094290 


0 


100 


0 


0 


NT2RP70094660 


0 


100 


0 


0 


NT2RP70094810 


0 


58.409 


0 


0 


NT2RP70094980 


0 


44.133 


0 


0 


NT2RP70095020 


0 


100 


0 


0 


NT2RP70095070 


0 


100 


0 


0 
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Table 4 (continued) 


Clone ID 


NT2RM 


NT2RP 


NT2RI 


NT2NE 


NT0NG1 0000980 


° 








NT0NG1 00021 40 




12 777 






NTONG20002650 




10 048 






NTONG20016120 


0 


21 .562 


0 


0 


PEBLM2G0G3950 


0 


8.711 


0 


0 


PR0ST1 0005640 


0 


10.708 


0 


0 


PROST20003250 


0 


24.163 


0 


0 


SKNMC20000650 


0 


3.384 


0 


0 


SKNSH1 0000860 


O 


23.605 


0 


0 


SKNSH20003470 


0 


15.832 


0 


0 


TEST1 10000510 


0 


11.33 


0 


0 


TEST1 10000960 


0 


51.106 


0 


0 


TESTI20015110 


0 


65.524 


0 


0 


TEST1 20074640 


0 


12.147 


0 


0 


TRACH20004610 


0 


13.344 


0 


0 



Clone ID 



BEAST TBAES 



3NB691 0001 730 
FCBBF1 0007600 
KIDNE20033050 
KIDNE20060300 
NT2R 120065530 
NT2RP60000720 
NT2RP70075370 
TRACH20004200 
LIVER1 0000670 
LIVER1 0005420 
LIVER20000370 



33.793 
75.606 
40.478 
69.585 
81.65 
69.448 
57.062 
94.946 



Clone ID 



BRACE1 0001 590 

HHDPC20000950 

HSYRA20016210 

NT2R1 20074980 

3NB6920014330 

NT2RI20087490 

NT2RP60001090 

PROST1 0002200 

SKNMC20003220 

STOI\/IA20001210 



33,161 
80,827 
42.592 
58.42 
49.81 



57.778 
40.177 
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Clone ID 



BRACE20028610 
BRACE20011170 
BRACE20035940 
IMR3220013320 
NT2NE20053710 



78.541 
95.04 
60.529 
26.798 



Clone ID 



KIDNE20005740 
MAMGL1 0000320 
NESOP1 0000870 
NT2RI20056470 
NTONG20008000 



49.196 
69.766 
78.683 



Clone ID 



3NB6920002810 

ADRGL1 0000020 

BNGH420004740 

BRACE1 0000200 

BRACE1 0000420 

BRACE1 0000730 

BRACE10001590 

BRACE20005650 

BRACE20016730 

BRACE20028120 

BRACE20077980 

BRACE20083800 

BRACE20083850 

BRAWH 10001 740 

BRAWH20036930 

BRAWH20064500 

BRAWH20064930 

GTONG20028030 

FCBBF20015380 

FEBRA20005360 

FEBRA20007570 

FEBRA20008740 

FEBRA20012270 

FEBRA20025250 

HSYRA200024B0 

HSYRA20006400 

HSYRA2000B2B0 

HSYRA20015740 

HSYRA20016210 

IMR3220009350 

LIVER10001110 
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Table 9 (continued) 



Clone ID 



NT2NE20003920 

NT2NE20007630 

NT2NE20007870 

NT2RI20025410 

NT2RI20026540 

NT2RI20029580 

NT2RI20033380 

NT2RI20033830 

NT2RI20051500 

NT2RI20058110 

NT2RI20090650 

NT2RP60000720 

NT2RP70013350 

NT2RP70023790 

NT2RP70024490 

NT2RP70028750 

NT2RP70029060 

NT2RP70036800 

NT2RP70075370 

NT2RP70076100 

NT0NG1 0000980 

NT0NG1 0002460 

NTONG20015500 

OCBBF20002310 

OCBBF20013070 

PEBLM20001470 

PEBLM20003950 

PLACE60021510 

PLACE60040050 

PLACE60043970 

PROST20051430 

STOMA20001210 

STOMA20002570 

STOMA20002890 

SYNOV20011440 

TESTI1 0000230 

TESTI20009700 

TESTI20021490 

TESTI20032800 

TESTI20053960 

TESTI20080200 

TESTI20082400 

BGG1 12001 0970 

BRACE20004210 

BRACE20005250 

BRACE20011170 

BRACE20020910 

BRACE20080970 

BRAWH20000340 

BRAWH20006970 



4.336 
3.583 
6,242 
3,303 

19,111 
20.33 

10.476 
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Table 9 (continued) 



Clone ID 



BRAWH20011660 
FCBBF20001950 
FEBRA20043250 
HLUNG1 0000640 
IMR3220007420 
IMR3220014350 
KIDNEIOOOOOaO 
KIDNE10000280 
KIDNE10000500 
KIDNE10001040 
KIDNE10001430 
KIDNE10001450 
KIDNE10001520 
KIDNE20000410 
KIDNE20000510 
KIDNE20000700 
KIDNE20000850 
KIDNE20001670 
KIDNE20001920 
KIDNE20002440 
KIDNE20002450 
KIDNE20002660 
KIDNE20003150 
KIDNE20003300 
KIDNE20003490 
KIDNE20003750 
KIDNE20004030 
KIDNE20004220 
KIDNE20004970 
KIDNE20005130 
KIDNE20005170 
KIDNE20005190 
KIDNE20005740 
KIDNE20031850 
KIDNE20033050 
KIDNE20033350 
KIDNE20033570 
KIDNE20033730 
KIDNE20033770 
KIDNE20037520 
KIDNE20039410 
KIDNE20039940 
KIDNE20040340 
KIDNE20040540 
KIDNE20040840 
KIDNE20042620 
KIDNE20042940 
KIDNE20042950 
KIDNE20043440 
KIDNE20044110 



65.363 
12.177 
23.921 
2.375 
4.092 
22.13 



5.561 
35.77 
12.49 
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Table 9 (continued) 



Clone ID 



KIDNE20045200 

KIDNE20045340 

KIDNE20045790 

KIDNE20046810 

KIDNE2004a2aO 

KIDNE20048640 

KIDNE20048790 

KIDNE20049810 

KIDNE20050420 

KIDNE20052960 

KIDNE20053360 

KIDNE20054000 

KIDNE20054770 

KIDNE20056290 

KIDNE20056760 

KIDNE20059080 

KIDNE20059370 

KIDNE20060140 

KIDNE200603Q0 

KIDNE20060530 

KIDNE20060620 

KIDNE20061490 

KIDNE20062990 

KIDNE20063530 

KIDNE20063760 

KIDNE20066520 

KIDNE20067600 

KIDNE20067750 

KIDNE20068800 

KIDNE20070050 

KIDNE20070770 

KIDNE20071860 

KIDNE20073280 

KIDNE20073520 

KIDNE20073560 

KIDNE20074220 

KIDNE20075690 

KIDNE2007B100 

KIDNE20078110 

LIVER1 0000790 

MAMGL1 0000320 

NB9N41 0000470 

NT2NE20053710 

NT2RI20006710 

NT2RI20013420 

NT2RI20016570 

NT2RI20018460 

NT2RI20025540 

NT2RI20040590 

NT2RI20065530 



24.137 
66.711 
100 
39.822 
4.537 
8.83 



3.598 
1.127 
2.454 
2.015 
23.435 
20.967 
5.573 
13.676 
3.41 



129 



EP 1 293 569 A2 

Table 9 (continued) 



Clone ID 

"NT2Rr20087490"' 
NT2RI20087910 
NT2RP60000350 
NT2RP60001230 
NT2RP70043730 
NT2RP70069860 
NT2RP70074220 
OCBBF20014940 
PLACE60020840 
PLACE60043120 
PR0ST1 0003430 
SKNSH20001510 
SMINT1 0000160 
SPLEN20000 
SPLEN20001340 
SPLEN20003570 
STOMA1 0000470 
TESTI1 0000700 
TESTI20027070 
TESTI20040310 
TRACH1 0000300 
TRACH20000790 
TRACH20002500 
TRACH20007800 



""17408 O' 

2.824 0 

5.311 0 

5.09 0 

14.846 0 

10.745 26.431 

3.96 0 

49.164 0 

2.658 0 

3.992 9.82 

25.547 0 

20.208 0 

38.817 15.914 

66.711 0 

88.909 0 

31.635 0 

17.849 0 

25.214 0 

14.795 0 

2.58 0 

11.119 0 

4.534 11.153 

35.282 0 

5.605 0 



Clone ID 



FCBBF50002610 
FEBRA20076220 
KIDNE20033050 
NT2NE20003840 
KIDNE20062480 
KIDNE20068800 
LIVER1 0000580 
LIVER1 0000670 
LIVER1 0000790 
LIVER1 0000990 
LIVER1 0001 040 
LIVER10001110 
LIVER1 0001 750 
LIVER1 0002300 
LIVER1 0002780 
LIVER1 0003030 
LIVER1 0004330 
LIVER1 0005420 
LIVER20000330 
LIVER20004160 
LIVER20004460 



7.391 
60.015 
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Table 10 (continued) 



Clone ID 

" LIVEFr200051 50" " 
NT2NE20002140 
NT2RI20030510 
NT2R1 20043040 
NT2RI20090650 
PR0ST1 0005640 
PROST20032320 
SALGL1 0001 570 
SMINT10000160 
SPLEN20002420 
TESTI20002530 
TESTI20080200 
THYMU1 0003590 
TRACH20004720 



100 o' 

27.596 0 

18.459 0 

50.8 0 

7.296 0 

26.335 0 

15.404 0 

2.428 0 

32.172 0 

83.286 0 

26.418 0 

3.383 0 

7.302 0 

6.356 0 



Clone ID 



NT2RI20030110 

BNGH41 0001 980 

BRACE1 0000420 

BRACE1 0001 150 

BRACE20014770 

BRACE20018550 

BRAWH20006970 

BRAWH20014610 

FEBRA20008810 

FEBRA20015840 

FEBRA20044120 

HHDPG20001490 

HLUNG1 0000240 

HLUNG1 0000300 

HLUNG1 0000370 

HLUNG1 0000640 

HLUNG1 0000760 

HLUNG1 0000990 

HLUNG1 0001 050 

HLUNG10001100 

HLUNG20000680 

HLUNG20001160 

HLUNG20001250 

HLUNG20001420 

HLUNG20001760 

HLUNG20002550 

HLUNG20003140 

HLUNG20004120 

HLUNG20004800 

HLUNG20005010 

HSYRA20014 



1.339 
28.126 

25.65 

8,521 
7.03 
19.713 
53.019 

15.75 
25.611 



14.018 
42.131 



5.302 
12.578 
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Table 11 (continued) 



Clone ID 

"KlbNE20062660" ' 
KIDNE20033050 
NT2NE20014350 
NT2RI20016570 
NT2R 120026540 
NT2RI20051500 
NT2RI20064120 
NT2RI20083960 
NT2RI20085260 
NT2RI20087490 
NT2RP70009060 
NT2RP70011660 
NT2RP70029060 
NT2RP70055020 
NT2RP70074220 
NT2RP70076100 
NT0NG1 0002460 
NTONG20008000 
PLACE60043120 
SKMUS20016340 
SKNIVIC20005930 
SMINT20000180 
SMINT20002390 
SIVIINT20002770 
SIVIINT20003960 
ST0MA1 0000470 
STOMA20001880 
SYN0V2001374 
TESTI20036250 
TESTI20080200 
TRACH20004610 



"15 "061 O' 

3.637 0 

28.99 0 

9.167 0 

8.156 0 

21.652 0 

9.093 0 

17.851 0 

5.474 0 

3.03 0 

18.337 0 

5.822 0 

6.519 0 

10.451 0 

8.521 0 

25.546 0 

16.426 0 

7.189 0 

8.589 0 

15.317 0 

13.727 0 

38.989 0 

51.283 0 

12.776 0 

10.489 0 

3B402 0 

52.43 0 

23.798 0 

32.684 0 

2.927 0 

28.395 0 



Clone ID 



NOVAR TOVAR 



BRACE20011880 

TESTI2003071 0 

BRACE20076210 

NT2RI2005368Q 

SKMUS20008630 

TESTI20005910 

TESTI20040310 



97.13 
78,467 
61 ,727 
94,963 
59.442 



Table 13 

Clone ID | STOMA I TSTOM 

HSYRA2001103Q 0 60,206 

NT2RI20013420 0 48.779 

NT2RP70079750 0 74.336 
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Table 13 (continued) 



Clone ID 
BRACE20003320 
HEART20005060 
HHDPC20000950 
HLUNG20Q04120 
HLUNG20Q05010 
HSYRA20006400 
KIDNE1 0000500 
KIDNE20062480 
NT2NE20053710 
NT2NE20054410 
NT2RI20015400 
NT2RI20016570 
NT2RI20064120 
NT2RI20070840 
NT2RI20071330 
NT2RI20074980 
NT2RI20077230 
NT2RI20089420 
NT2RP70000760 
NT2RP70028750 
PLACE60014430 
PLACE60024190 
SKNMC20000970 
ST0MA1 0000470 
ST0MA1 0000520 
ST0MA1 00011 70 
ST0MA1 0001 330 
ST0MA1 0001 860 
STOMA20000320 
STOMA20000880 
STOMA20001210 
STOMA20001880 
STOMA20002570 
STOMA20002890 
STOMA20003960 
STOMA20004780 
STOMA20004820 
THYMU 10003590 



28.838 O" 

8.996 0 

3.367 0 

38.225 0 

4.81 0 

23.013 0 

30.974 0 

2.901 0 

2.2 0 

65.251 0 

0.544 0 

4.159 0 

8.25 0 

15.758 0 

13.128 0 

3.862 0 

20.833 0 

17.218 0 

4.442 0 

3.021 0 

9.182 0 

54.046 0 

4.463 0 

34.842 0 

100 0 

100 0 

100 0 

100 0 

100 0 

100 0 

4.129 0 

47.57 0 

23.78 0 

4.851 0 

100 0 

100 0 

28.859 0 

5.733 0 



Clone ID 



NT2R1 20085260 
3NB6920002810 
BRACE1 0000420 
BRACE20089990 
BRACE20092120 
BRAWH1 0001 680 
BRAWH20011410 



28.795 
61.611 
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Table 14 (continued) 



Clone ID 

" B R AW H2 0 0 f 1 66 0 " 
FCBBF20005910 
FCBBF50002610 
FEBRA20005360 
FEBRA20006800 
FEBRA20008800 
FEBRA20044120 
FEBRA20057520 
HEART20005060 
HHDPC20000950 
HLUNG1 0000760 
HLUNG20003140 
HSYRA20014200 
HSYRA20014760 
HSYRA20015800 
IMR321 0002420 
IMR3220002230 
IMR3220009350 
IIVIR3220014350 
IMR3220016000 
KIDNE20000850 
KIDNE20060140 
KIDNE20060300 
MAMGL1 0000350 
NT2NE20035690 
NT2NE20053710 
NT2R1 10000270 
NT2R1 20000640 
NT2R1 20002940 
NT2R1 2001 0910 
NT2R1 2001 3420 
NT2RI20016570 
NT2RI20033380 
NT2RI20036950 
NT2RI20037510 
NT2RI20053350 
NT2R 120057230 
NT2RI20058110 
NT2RI20071480 
NT2RI20074980 
NT2RI20084810 
NT2RI20087490 
NT2RI20087910 
NT2RP60000350 
NT2RP70032030 
NT2RP70043730 
NTONG1 0000980 
NTONG1 0002460 
PLACE60014430 
PLACE60026680 



"37222 0 

17.567 0 

4.011 0 

7.069 0 

81.993 0 

29.932 0 

14.823 0 

5.036 0 

1 .885 0 

3.26 0 

7.12 0 

6.388 0 

5.742 0 

36.126 0 

7.465 0 

3.637 0 

4.412 0 

4.472 0 

0.788 0 

5.232 0 

15.135 0 

3.176 0 

1.122 0 

11.316 0 

1 .232 0 

61 ,432 0 

3.308 0 

19.984 0 

15.542 0 

2.201 0 

2.328 0 

10.637 0 

5.07 0 

2.677 0 

2.517 0 

2.613 0 

6.719 0 

25.67 0 

4.324 0 

20.5 0 

1 .539 0 

12.345 0 

1.451 0 

4.778 0 

8.111 0 

7.275 0 

4.171 0 

10.28 0 

15.19 0 
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Table 14 (continued) 



Clone ID 

"PLACE6004396d" 
PLACE60044910 
PLACE60047380 
PR0ST1 0002200 
PR0ST1 0005260 
PROST20025910 
PROST20033380 
PUAEN1 0000570 
SALGL1 0001 570 
SKMUS1 00001 40 
SKMUS20003430 
SKMUS20009540 
SKNMC1 0002510 
SKNMC20000970 
SKNSH1 0000860 
SMINT20002770 
STOMA20002890 
SYNOV20011440 
TEST1 10000230 
TESTI20018290 
TESTI20021490 
TESTI20080200 
TESTI20082400 
TRACK 10000300 
TRACH20002370 
TRACH20007800 
TRACH20012890 
UTERU 10000770 
UTERU 10000960 
UTERU10001600 
UTERU1 0001 920 
UTERU20000470 
UTERU20003380 
UTERU2O0039 
UTERU20004850 
UTERU20005410 
UTERU20005690 



""10.64 O' 

52.136 0 

52.136 0 

1.976 0 

12.466 0 

51.788 0 

16.15 0 

14.201 0 

1.067 0 

23.507 0 

35.091 0 

9.414 0 

6.618 0 

2.498 0 

25.511 0 

3.244 0 

2.716 0 

4.393 0 

8.707 0 

12.741 0 

14.455 0 

2.973 0 

2.555 0 

12.149 0 

20.883 0 

12,248 0 

4.662 0 

100 0 

50.58 0 

100 0 

100 0 

100 0 

35.158 0 

100 0 

100 0 

33.583 0 

50.58 0 



Clone ID 



3NB6910001160 
3NB6910001290 
3NB691 0001 730 
BNGH420004740 
BRACE20008850 
BRACE20020910 
BRACE20074010 
BRAWH20014840 
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Table 15 (continued) 



NTONG CTONG 



Clone ID 

"BRAWH20689560' 
CTONG20003030 
CTONG20005890 
CTONG20007710 
CTONG2000a270 
GTONG20011390 
CTONG20013200 
CTONG20013660 
CTONG20015330 
CTONG20018200 
CTONG20019110 
CTONG20019550 
GTONG20020730 
CTONG20021430 
CTONG200241 80 
CTONG20024530 
CTONG20025580 
CTONG20027210 
CTONG20028030 
CTONG200281 60 
CTONG20028200 
CTONG20029650 
GTONG20037820 
CTONG200471 60 
CTONG20055530 
GTONG20064490 
FEBRA20003770 
FEBRA20004520 
FEBRA20007400 
FEBRA20007570 
FEBRA20012940 
FEBRA20021940 
FEBRA20044120 
HCASM 10001 150 
HHDPC20004560 
HLUNG20003140 
HSYRA20002480 
IMR3220009350 
IMR3220012180 
KIDNE20000850 
KIDNE20002660 
KIDNE20004220 
KIDNE20005740 
KIDNE20056760 
KIDNE20060140 
KIDNE20062480 
MESAN20000920 
MESAN20003370 
NHNPC20002060 
NT2NE1 0001 850 



0 2T.778 

0 100 

0 39.66 

0 100 

0 18.957 

0 100 

0 19.93 

0 100 

0 100 

0 100 

0 48.152 

0 100 

0 25.035 

0 100 

0 100 

0 23.734 

0 57.263 

0 100 

0 8.603 

0 100 

0 55.786 

0 100 

0 100 

0 100 

0 38.023 

0 24.327 

0 22.646 

0 19.228 

0 4.377 

0 1 .607 

0 7.503 

0 1.128 

0 4.895 

0 1 ,394 

0 18.986 

0 4.356 

0 3.688 

0 2.7 

0 3.684 

0 1.601 

0 9.361 

0 23.916 

0 1 .538 

0 21.746 

0 9.261 

0 2.981 

0 19.727 

0 17.82 

0 4.429 

0 39.142 
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Table 15 (continued) 



Clone ID 

"NT2"NE2b00056"d" " 
NT2NE20002140 
NT2NE20003270 
NT2NE20003840 
NT2NE20014350 
NT2NE20053710 
NT2RI20006690 
NT2RI20006710 
NT2RI20016570 
NT2RI20018660 
NT2R1 20025300 
NT2R1 20025410 
NT2R1 200301 90 
NT2R1 20030510 
NT2RI20036950 
NT2RI20046060 
NT2RI20053350 
NT2RI20067350 
NT2R 120075720 
NT2RI20078790 
NT2RI20083960 
NT2RI20087140 
NT2RI20094060 
NT2RP60000350 
NT2RP60001230 
NT2RP70000760 
NT2RP700 04770 
NT2RP70009060 
NT2RP70011660 
NT2RP70023760 
NT2RP70023790 
NT2RP70024500 
NT2RP70026190 
NT2RP70029820 
NT2RP70036470 
NT2RP70043730 
NT2RP70061880 
NT2RP70071770 
NT2RP70076100 
NT2RP70079750 
NT2RP70084870 
NT2RP70093730 
OCBBF20013070 
PEBLM20003950 
PLACE60037460 
PLACE60043120 
PROST10003 
PROST1 0005260 
PROST2003232 
PROST2003302 
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Table 15 (continued) 



Clone ID 

"PROST20056040 " 
SKNMC1 0002510 
SKNMC20000650 
SKNMC20010570 
SKNSH20003470 
SMINT20000180 
SYNOV20013740 
TESTI1 0000230 
TESTI1 0001 680 
TESTI20007840 
TESTI20021490 
TESTI20022230 
TESTI20023690 
TESTI20030050 
TESTI20042950 
TESTI20068720 
TESTI20080200 
TRACH20012890 
BRACE20006980 
BRACE20092740 
BRAWH20006970 
FCBBF1 0007600 
FEBRA20062700 
IMR3220016000 
KIDNE20073280 
MAMGLI 0000350 
NT2NE20035690 
NT2RI20056470 
NT2RI20058110 
NT2RI20084810 
NT2R 120085260 
NT2RP70015910 
NT2RP70036290 
NT2RP70036320 
NT2RP70074220 
NT2RP70075370 
NT0NG1 0000330 
NT0NG1 0000520 
NT0NG1 0001 230 
NT0NG1 0001 300 
NT0NG1 0001 820 
NT0NG1 00021 40 
NT0NG1 0002460 
NT0NG1 0002570 
NT0NG1 0002640 
NTONG20002650 
NTONG20003340 
NTONG20003630 
NTONG20004920 
NTONG20005830 



55.471 
22.273 
8.372 
6.676 
19.42 
3.049 
9.592 
2,171 
21,893 
23,487 
12.999 
13.22 
5.378 
46.836 
20.836 
18.528 
8.372 
5.038 



39.112 
80.147 



67.153 
21.01 
27.701 
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Table 15 (continued) 



Clone ID 

"NT6NG20008000" 
NTONG20008780 
NTONG20009660 
NTONG20009850 
NTONG20011370 
NTONG20012220 
NTONG20014280 
NTONG20015500 
NTONG20016120 
OCBBF20011240 
OCBBF20015860 
PR0ST1 0002200 
SKMUS20016340 
SKNMC20000970 
STOMA20004820 
SYNOV10001280 
SYNOV20011440 
THYMU 10000830 
TRACH20000790 
TRACH20009260 



100 
76.302 
52.537 
45.084 
24.128 
50.483 
22.942 
15.05 
4.833 
31.253 
13.986 
8.499 
27.581 
19.169 
30.323 
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Table 16 
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Table 19 
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Table 29 
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Table 30 
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Table 32 
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Table 34 
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Table 36 
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Clone ID 



KIDNE20062480 

NT2R1 20033040 

NT2RP60000350 

BGGI120010970 

BRACE1 0000420 

BRACE1 0001 150 

BRACE20003320 

BRACE20077980 

BRAWH 10000370 

BRAWH2000034G 

BRAWH20011660 

BRAWH20014840 

FEBRA20008740 

FEBRA20072800 

HEART20000350 

HEART20000990 

HEART20003090 

HEART20004110 

HEART20004480 

HEART20004920 

HEART20005060 

HEART20005200 

HEART20005680 

HHDPC20001150 

HLUNG20005010 

HSYRA20014200 

IIVIR3220013170 

KIDNE20004970 

NT2RI20000640 

NT2RI20006710 

NT2RI20015400 

NT2RI20026540 

NT2RI20037510 

NT2R1 20057230 

NT2R1 200641 20 

NT2RI20071330 

NT2RI20071480 

NT2RI20077540 

NT2RI20084810 

NT2RI20087910 

NT2RP70000760 

NT2RP70024500 

NT2RP70029060 

NTONG1 0001 820 

PLACE60012810 

PLACE60043120 

PROST20000530 

SKMUS1 0000640 

SKMUS20004580 



36.242 
40.301 
32.381 
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Table 37 (continued) 



Clone ID 
SKMUS20015010 
SMINT20002770 
TESTI20033250 
TESTI20074640 
UMVEN20001330 



"O 18.378 

0 6.315 

0 58.713 

0 25.552 

0 6.282 



Clone ID 
3NB6920003300 
3NB692000gi20 
BGG1 12001 0970 
BRACE20004210 
BRACE20005250 
BRACE20011170 
BRACE20020910 
BRACE20026850 
BRACE20080970 
BRAWH20000340 
BRAWH20006970 
BRAWH20011660 
FCBBF20001950 
FEBRA20021940 
FEBRA20043250 
HLUNG1 0000640 
IMR3220007420 
IMR3220014350 
KIDNE1 0000280 
KIDNE1 0000500 
KIDNE1 0001 040 
KIDNE1 0001 430 
KIDNE1 0001 450 
KIDNE10001520 
KIDNE20000410 
KIDNE20000510 
KIDNE20000700 
KIDNE20000850 
KIDNE20001670 
KIDNE20001920 
KIDNE20002440 
KIDNE20002450 
KIDNE20002660 
KIDNE20003150 
KIDNE20003300 
KIDNE200034gO 
KIDNE20003750 
KIDNE20004030 
KIDNE20004220 
KIDNE20004970 



0 2.071 

0 2.884 

0 4.336 

0 3.583 

0 6.242 

0 3.303 

0 19.111 

0 34.356 

0 20.33 

0 10.476 

0 3.96 

0 5.897 

0 65.363 

0 1 .687 

0 12.177 

0 23.921 

0 2.375 

0 4.092 

0 100 

0 15.868 

0 100 

0 100 

0 19.052 

0 100 

0 100 

0 100 

0 100 

0 4.788 

0 100 

0 100 

0 37.565 

0 100 

0 7 

0 100 

0 20.536 

0 64.026 

0 100 

0 5.561 

0 35.77 

0 12.49 
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Table 38 (continued) 



Clone ID 



KIDNE20005130 
KIDNE20005170 
KIDNE20005190 
KIDNE20005740 
KIDNE20031850 
KIDNE20033050 
KIDNE20033350 
KIDNE20033570 
KIDNE20033730 
KIDNE20033770 
KIDNE20037520 
KIDNE20039410 
KIDNE20039940 
KIDNE20040340 
KIDNE20040540 
KIDNE20040840 
KIDNE20042620 
KIDNE20042940 
KIDNE20042950 
KIDNE20043440 
KIDNE20045200 
KIDNE20045340 
KIDNE20045790 
KIDNE20046810 
KIDNE20048280 
KIDNE20048640 
KIDNE20048790 
KIDNE20049810 
KIDNE20050420 
KIDNE20052960 
KIDNE20053360 
KIDNE20054000 
KIDNE20054770 
KIDNE20056290 
KIDNE20056760 
KIDNE20059080 
KIDNE20059370 
KIDNE20060140 
KIDNE20060300 
KIDNE20060530 
KIDNE20060620 
KIDNE20061490 
KIDNE20062480 
KIDNE20062990 
KIDNE20063530 
KIDNE20063760 
KIDNE20066520 
KIDNE20067600 
KIDNE20067750 
KIDNE20068800 
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Table 38 (continued) 



Clone ID 
KIDNE20070050 
KIDNE20070770 
KIDNE20071860 
KIDNE20073280 
KIDNE20073520 
KIDNE20073560 
KIDNE20074220 
KIDNE20075690 
KIDNE20078100 
KIDNE20078110 
LIVER1 0000790 
MAMGL1 0000320 
NB9N41 0000470 
NT2NE20053710 
NT2RI20006710 
NT2RI20013420 
NT2RI20016570 
NT2RI20018460 
NT2RI20025540 
NT2RI20040590 
NT2RI20065530 
NT2R1 20087490 
NT2RI20087910 
NT2RP60000350 
NT2RP60001230 
NT2RP70043730 
NT2RP70069860 
NT2RP70074220 
OCBBF20014940 
PLACE60014430 
PLACE60020840 
PLACE60043120 
PR0ST1 0003430 
SKNMC20000970 
SKNSH20001510 
SMINT1 00001 60 
SMINT20003960 
SPLEN20000470 
SPLEN20001340 
SPLEN20003570 
STOMA1 0000470 
SYNOV1 0001 280 
TEST! 10000700 
TESTI20027070 
TESTI20040310 
TRACH 10000300 
TRACH20000790 
TRACH20002500 
TRACH20007800 
KIDNE1 0000080 



"O 66.711 

0 100 

0 39.822 

0 4.537 

0 8.83 

0 100 

0 100 

0 100 

0 100 

0 100 

0 15.673 

0 1.138 

0 3.598 

0 1.127 

0 2.454 

0 2.015 

0 23.435 

0 20.967 

0 5.573 

0 13.676 

0 3.41 

0 1 .408 

0 2.824 

0 5.311 

0 5.09 

0 14.846 

0 10.745 

0 3.96 

0 49.164 

0 4.704 

0 2.658 

0 3.992 

0 25.547 

0 1.143 

0 20.208 

0 38.817 

0 1 .625 

0 66.711 

0 88.909 

0 31 .635 

0 17.849 

0 6.616 

0 25.214 

0 14.795 

0 2.58 

0 11.119 

0 4.534 

0 35.282 

0 5.605 
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Table 38 (continued) 



Clone ID 



KIDNE20044110 
NT2RI20033040 
NT2RI20037510 
NT2RP70065270 
TRACH20012890 



92. 49 
40.707 
60.346 
40.543 
52.552 



Clone ID 



BNGH41 0001 980 

BRACE1 0000420 

BRACE1 0001 150 

BRACE20014770 

BRACE20018550 

BRAWH20006970 

BRAWH20014610 

FEBRA20008B10 

FEBRA20015B40 

FEBRA20044120 

HHDPC20001490 

HLUNG1 0000240 

HLUNG1 0000300 

HLUNG1 0000370 

HLUNG1 0000640 

HLUNG1 0000760 

HLUNG1 0000990 

HLUNG1 0001 050 

HLUNG10001100 

HLUNG20000680 

HLUNG20001160 

HLUNG20001250 

HLUNG20001420 

HLUNG20001760 

HLUNG20002550 

HLUNG20003140 

HLUNG20004120 

HLUNG20004800 

HLUNG20005010 

HSYRA20014200 

KIDNE20002660 

KIDNE20033050 

NT2NE20014350 

NT2R1 2001 6570 

NT2R1 20026540 

NT2RI20051500 

NT2R 120064 120 

NT2RI20083960 

NT2R 120085260 

NT2R 120087490 
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Table 39 (continued) 



Clone ID 
NT2RP70009060 
NT2RP70011660 
NT2RP70029060 
NT2RP70055020 
NT2RP70074220 
NT2RP70076100 
NTONG1 0002460 
NTONG20008000 
PLACE60043120 
SKMUS20016340 
SKNMC20005930 
SMINT20000180 
SIVIINT20002390 
SMINT20002770 
SMINT20003960 
ST0MA1 0000470 
STOMA20001880 
SYNOV20013740 
TESTI20036250 
TESTI20080200 
TRACH20004610 
BRACE20004210 
IMR3220007420 



18.337 
5.822 
6.519 

10.451 
8.521 

25.546 

16.426 

8.589 
15.317 
13.727 
38.989 
51.283 
12.776 

38.402 
52.43 

23.798 

32.684 
2.927 

28.395 



Alteration of the expression level of each clone due to TNF-a or LPS stimulation to human monocyte cell line 


THP-1 and alteration of the expression level of each clone due to co-culture of gastric cancer cell line MKN45 with 


Helicobacter pylori, ctl, TNF and LPS i 


n the column of THP-1 , respectively, indicate the relative expression levels 


in unstimulated THP-1, in the cell stimulated with 10 ng/mL TNF-a for 3 hours, and in theceli stimulated with 1 \ig/ 


mL LPS for 3 hours; ctl and H. pylori in the column of MKN45 indicate the relative expression levels in MKN45 


cultured without Helicobacter pylori and in MKN45 co-cultured with Helicobacter pylori (aX a rat 


0 of IWIKN45: Heli- 


cobacter pylon = ^■.^00 cells (colonies) forS hours, respectively 






[ATAC-PCR] 












Clone name 


THP-1 


MKN45 




ctl 


TNF 


LPS 


ctl 


H. pylori 


3NB6920000290 


2.0 


1.9 


0.4 


0.1 


0.0 


ADRGL1 00001 80 


2.2 


5.1 


2.0 


3.3 


5.7 


BNGH41 0001 370 


0.8 


1.4 


0.3 


0.4 


0.5 


BRACE1 0001 590 


1.5 


2.3 


1.6 


0.4 


0.8 


BRAGE1 0001 690 


2.3 


3.6 


2.9 


2.2 


1.8 


BRAGE20010650 


2.1 


2.2 


2.1 


2.2 


2.1 


BRACE20013400 


2.6 


0.8 


0.2 


1.5 


1.1 


BRACE20030780 


0.3 


1.3 


0.0 


2.2 


1.7 


BRACE20034490 


2.0 


1.6 


0.6 


2.5 


0.3 


BRACE20077640 


0.4 


1.0 


0.1 


1.7 


0.3 


BRACE20079530 


0.6 


0.1 


0.1 


0.0 


0.2 


BRAGE20083850 


0.9 


2.5 


1.3 


1.1 


0.0 


BRAGE20091880 


1.5 


0.5 


0.1 


0.5 


0.0 
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Clone name 


THP-1 


MKN45 


ctl 


TNF 


LPS 


Ctl 


H. pylori 


BRAWH1 0001 620 


1.8 


1.0 


0.3 


1.1 


3.1 


BRAWH20004430 


0.2 


1.4 


0.2 


0.5 


0.5 


FCBBF1 00061 80 


0.1 


3.6 


1.3 


2.1 


0.7 


FEBRA20003780 


1.5 


3.0 


2.8 


1.7 


1.3 


FEBRA20006800 


0.7 


2.4 


0.9 


0.0 


1.6 


FEBRA20008810 


2.3 


1.4 


0.9 


2.3 


1.3 


FEBRA20012940 


0.4 


1.0 


0.2 


0.7 


0.6 


FEBRA20015840 


0.1 


3.3 


2.6 


0.1 


0.0 


HCASM 10000610 


1.8 


2.0 


2.1 


2.3 


2.2 


HEART20000350 


2.0 


3.3 


1.8 


2.3 


0.5 


HEART200044B0 


0.0 


0.3 


0.0 


3.5 


3.0 


HEART20005060 


1.2 


0.6 


0.0 


4.8 


4.5 


HHDPC20000950 


0.4 


0.2 


0.1 


1.4 


0.7 


HLUNG1 0000370 


0.0 


1.3 


0.2 


2.6 


0.7 


HLUNG20001160 


0.6 


3.7 


1.7 


0.1 


0.0 


HLUNG20001760 


1.4 


0.5 


0.0 


0.3 


0.0 


HSYRA20003470 


1.1 


1.5 


0.6 


1.1 


0.3 


HSYRA20013320 


0.1 


1.7 


0.7 


1.4 


0.7 


IMR32 1000 1580 


0.4 


0.0 


0.0 


0.3 


0.2 


IMR32 10002660 


0.8 


0.4 


0.2 


0.5 


0.4 


IMR3220008380 


0,4 


0.9 


0.4 


1.1 


0.5 


IMR3220008590 


2,0 


0.3 


0.9 


2.1 


3.4 


KIDNE10001520 


0,4 


1.2 


0.7 


3.4 


2.2 


KIDNE20000850 


0.7 


0.7 


0.4 


1.1 


0.5 


KIDNE20003490 


0.9 


1.7 


0.9 


0.3 


1.9 


KIDNE20005170 


0.9 


0.7 


0.3 


6.4 


0.2 


KIDNE20033730 


1.2 


1.6 


1.8 


0.5 


0.5 


KIDNE20040540 


0.1 


2.5 


0.8 


0.1 


0.2 


KIDNE20050420 


0.7 


0.5 


0.7 


0.2 


0.3 


KIDNE20061490 


0,1 


1.2 


0.6 


0.2 


0.1 


KIDNE20062990 


0,7 


3.5 


0.7 


0.0 


0.0 


LIVER20000330 


6.0 


7.4 


1.1 


7.3 


0.6 


NT2NE1 0001 630 


0.1 


2.6 


1.4 


2.5 


2.7 


NT2NE1 0001 850 


1.1 


0.3 


0.1 


0.4 


1.1 


NT2NE20003920 


0.6 


3.3 


1.6 


0.5 


0.2 


NT2NE20005500 


0.6 


2.6 


2.2 


0.1 


0.2 


NT2RI20009740 


1.2 


1.9 


0.9 


0.1 


0.0 


NT2RI20014500 


0.2 


3.5 


1.7 


0.0 


0.0 


NT2RI20016570 


1.2 


3.6 


3.1 


0.0 


0.1 


NT2RI20018660 












NT2RI20021520 


1.1 


1.5 


1.1 


1.1 


0.5 


NT2RI20050870 


1.0 


0.6 


0.3 


0.7 


0.8 


NT2RI20053350 


2.2 


1.0 


0.9 


1.5 


0.6 


NT2RI20070480 


1.5 


0.3 


0.0 


0.0 


0.0 


NT2RI20073030 


0.4 


0.6 


0.4 


2.0 


0.8 


NT2RI20078270 


0.2 


2.6 


0.5 


1.4 


0.3 


NT2RI20078790 


1.7 


2.4 


0.7 


2.2 


1.6 


NT2R1 20083360 


0.1 


0.4 


0.2 


0.1 


0.3 
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Clone name 


THP-1 


MKN45 


ctl 


TNF 


LPS 


ctl 


H. pylori 


NT2RP60000080 


2.2 


2.5 


1.5 


2.0 


1.1 


NT2RP60000390 


1.6 


2.3 


1.2 


2.3 


0.4 


NT2RP60000590 


1.6 


2.0 


1.3 


2.5 


1.3 


NT0NG1 0000980 


0.9 


0.8 


0.8 


1.9 


0.0 


NT0NG1 0002570 


0.1 


5.4 


0.5 


0.0 


0.0 


PLACE60020160 


0.8 


1.2 


0.6 


0.0 


0.0 


PLACE60026990 


0.5 


0.2 


0.1 


3.2 


2.1 


PLACE60047380 


1.6 


0.7 


0.9 


1.8 


2.2 


PUAEN 10003220 


0.1 


1.9 


1.2 


0.0 


0.1 


SKNMC1 0000290 


0.4 


1.2 


0.5 


0.9 


0.2 


SKNMC1 0001 590 


1.7 


1.9 


0.8 


1.0 


0.4 


SKNMC20000650 


1.1 


1.1 


0.8 


0.1 


0.1 


STOIVIA20002570 


0.3 


3.1 


1.5 


0.6 


0.6 


STOIVIA20002890 


1.8 


0.8 


0.4 


0.1 


0.2 


SYNOV20001 770 


1.7 


0.5 


0.5 


1.7 


4.2 


TEST1 10000230 


2.7 


4.6 


3.1 


2.3 


1.7 


TEST1 10000550 


0.4 


0.1 


0.3 


0.1 


2.8 


TESTI20011340 


0.3 


2.3 


2.2 


2.2 


1.9 


THYMU 10005580 


1.1 


2.1 


1.1 


1.5 


1.1 


TRACH1 0000630 


0.3 


0.5 


0.2 


2.7 


2.6 


TRACH20001960 


9,4 


2.7 


1.3 


1.0 


0.4 


UMVEN10001220 


1,9 


2.8 


1.0 


0.8 


0.2 


UMVEN20001330 


2,2 


2.2 


0.0 


1.0 


0.1 


UTERU20004850 


1.7 


6.3 


2.3 


0.0 


0.3 



Homology Search Result Data 



35 [0295] Data obtained by the homology search for full-length nucleotide sequences and deduced amino acid sequenc- 
es. 

[0296] in the result of the search shown below, both units, aa and bp, are used as length units for the sequences to 

be compared, 

[0297] Each data includes Clone name, Definition in hit data, P value, Length of sequence to be compared, Homology, 
40 and Accession number (No.) of hit data. These items are shown in this order and separated by a double-slash marl<, //, 
3NB6910000180 
3NB691 0000850 

3NB6910001160//STEROIDOGENIC ACUTE REGULATORY PROTEIN PRECURSOR,//9.70E-08//160aa//21%// 
Q28996 
45 3NB691 0001 290 
3NB691 0001 730 
3NB6920000290 

3NB6920002810//PUTATiVE ATP-DEPENDENT RNA HELICASE T26G10.1 IN CHROMOSOME III.//4.70E-154// 
442aa//64%//P34580 
50 3NB6920003300//YIP1 PROTEiN.//1 .80E-35//1 81 aa//41 %//P53039 
3NB6920005450 
3NB6920009120 

3NB6920010020//REGULATOR OF G-PROTEIN SIGNALING 3 (RGS3) (RGP3).//1 .60E-89//1 79aa//95%//P49796 
3NB6920010220//putative C3HC4-type RING zinc finger protein//3.70E-38//374aa//29%//AAG27460 
55 3NB6920013490 
3NB6920014330 

3NB6920014710//Homo sapiens hepatocellular carcinoma-associated antigen 58 (HCA58) mRNA, complete cds.// 
5.40E-1 30//236aa//1 00%//AF22041 6 
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3NB6920015110//CARG-BINDING FACTOR-A (CBF-A).//7.50E-140//290aa//90%//Q99020 
3NB6920015280//LIGHT-MEDIATED DEVELOPMENT PROTEIN DET1 y/4.60E-55//263aa//35%//P48732 
3NB6920015570//ZINC FINGER PROTEIN 135.//7.90E-129//365aa//60%//P52742 

3NB6920016370 
5 3NB6920017190 

ADRGL10000020//Homo sapiens Kelch-IIke 1 protein (KLHL1) mRNA, complete cds.//2.90E-298//546aa//100%// 

AF252283 
ADRGL1 00001 80 

ADRGL10000650//ZING FINGER PROTEIN 135.//2.20E-76//205aa//64%//P52742 
10 ADRGL10001600//CYTOCHROME P450 XXIB (EC 1.14.99.10) (STEROID 21 -HYDROXYLASE) (P450-C21 B).//2. 
50E-248//397aa//98%//P0B6B6 

ADRGL10001650//IMIDAZOLONEPROPIONASE (EC 3.5.2.7) (IMIDAZOLONE-5-PROPIONATE HYDROLASE).// 
6.1 0E-67//41 8aa//37%//P42084 
ADRGL1 0001 820 

15 A0R6L20000740//RHO-GTPASE-ACTIVATING PROTEIN 6 (RHO-TYPE GTPASE-ACTIVATING PROTEIN RHOG- 
APX-i ),//1 .50E-67//327aa//43%//0431 82 
ADRGL20003230 
ADRGL20004280 

ASTR01 00001 80//DYNEIN INTERMEDIATE CHAIN 3, CILIARY.//1 .10E-32//207aa//33%//016960 
20 ASTR020000950 

ASTR020004170//Homo sapiens sorting nexin 5 (SNX5) mRNA, complete cds.//5.20E-47//98aa//100%//AF121855 

ASTR020004800 

BGG1 11 0002850 

BGG1 120001 61 0//CELL DIVISION CONTROL PROTEIN 1.//3.10E-14//218aa//28%//P40986 
25 BGGI120005330//INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE 2 (EC 1 .1 .1 .205) (IMP DEHYDROGENASE 
2) (IMPDH-II) (IMPD 2).//1.40E-218//415aa//80%//P12269 

BGGI1200G5440//Homo sapiens snurportlnl mRNA. complete cds.//3.50E-199//360aa//99%//AF039029 
BGGI120006840//Homo sapiens sirtuin type 2 (SIRT2) mRNA, complete cds.//7.60E-1 97//371 aa//98%//AF0831 07 
BGGi120006930//POLYHOMEOTiC-PROXIMAL CHROMATIN PROTEIN.//2.30E-11//100aa//42%//P39769 
30 BGGi120010970//Homo sapiens contactin associated protein (Caspr) mRNA, complete cds.//3.50E-1 03//464aa// 
36%//U 87223 

BGGi 12001 71 40//ZiNC FINGER PROTEIN 124 (HZF-16).//1.60E-127//217aa//100%//Q15973 
BNGH4i O000030//R.norvegicus trg mRNA.//3,10E-111//361aa//60%//X68101 
BNGH41 0000130 
35 BNGH41 00001 70 
BNGH41 0000290 
BNGH41 0000330 

BNGH410000340//DIPEPTiDYL PEPTIDASE iV (EC 3,4.14.5) (DPP iV) (THYMOCYTE-ACTiVATING MOLECULE) 
(THAM).//3.20E-36//262aa//38%//P28843 
40 BNGH41 0000390//DYNEiN BETA CHAIN, CILIARY.//4.60E-136//331 aa//72%//P23098 

BNGH410000800//Homo sapiens zinc finger protein dp mRNA, complete cds,//2.80E-11//103aa//41%//AF153201 
BNGH41 0001 040 

BNGH410001180//Homo sapiens low density lipoprotein receptor related protein-deleted in tumor (LRPDIT) mRNA, 
complete cds.//0//752aa//96%//AF1 76832 
45 BNGH410001370//BRUSH BORDER61.9 KD PROTEIN PRECURSOR.//6.30E-72//555aa//31%//Q05004 
BNGH41 0001 530 

BNGH4100O1770//INOSINE-5'-MONOPHOSPHATE DEHYDROGENASE 1 (EC 1.1.1.205) (IMP DEHYDROGE- 
NASE 1) (IMPDH-I) (IMPD 1).//2.10E-270//514aa//99%//P20839 BNGH410001900//Rattus norvegicus schlafen-4 
(SLFN-4) mRNA, complete cds.//3.80E-81//56Baa//35%//AF1 68795 
50 BNGH41 0001 980//TETRACYCLiNE RESISTANCE PROTEIN, CLASS E CTETA(E)).//1 .20E-1 5//345aa//26%//Q07282 

BNGH420004740 

BNGH420005320//ZiNC FINGER PROTEIN 36 (ZINC FINGER PROTEIN KOX18) (FRAGMENT).//6.40E-179// 

322aa//99%//P 17029 
BRACE1 0000200 

55 BRAGE1 0000420//PROTEIN PHOSPHATASE 20 ABi2 (EC 3.1 .3.16) (PP2C).//2. 10E-31//202aa//37%//004719 

BRACE1 0000700 

BRACE 1 0000730//HYPOTHETICAL 37.2 KDA PROTEIN C12C2.09C IN CHROMOSOME II.//9.60E-05//1 00aa//34%// 
Q09749 
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BRACE 10000930//TNF RECEPTOR ASSOCIATED FACTOR 2 (TRAF2).//2.10E-96//197aa//92%//P39429 
BRACE10001150//NUCLEOSOME ASSEMBLY PROTEIN 1-LIKE 2 (BRAIN-SPECIFIC PROTEIN, X- LINKED).// 
3.60E-1 7//1 44aa//36%//P51 860 
BRACE1 0001 590 
5 BRACE1 0001 660 
BRACE1 0001 690 

BRACE1 0001 870//MIGROTUBULE-ASSOCIATED PROTEIN 4.//8.30E-12//49aa//81%//P27816 
BRACE20000770 
BRACE20001000 
10 BRACE20001410 

BRACE20002800//MNN4 PROTEIN.//8.50E-1 0//237aa//29%//P36044 

BRACE20003320 

BRACE20004210 

BRACE20005050 

15 BRACE20005250//DRR1 PROTEIN fTUSA PROTEIN).//5.20E-74//144aa//100%//095990 
BRACE20005450 
BRACE20005650 

BRACE20005770//Homo sapiens PHR1 isoform 4 (PHRET1 ) mRNA, alternatively spliced, complete cds.//5.80E-48// 
91aa//100%//AF093249 

20 BRACE20006980//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID),//2.40E-16//279aa// 
30%//Q01484 

BRACE20007180//CALCITONIN GENE-RELATED PEPTIDE II PRECURSOR (CGRP-II) (BETA-TYPE CGRP).// 
2.70E-61//1 27aa//99%//P1 0092 

BRACE20008850//CALDESMON (CDM)y/3.50E-08//203aa//29%//P12957 
25 BRACE20009880 
BRACE20010650 
BRACE20010700 
BRACE20011170 

BRACE20011430//Zea mays clone AGPZml arabinogalactan protein (agp) mRNA, partial cds.//3.70E-06//176aa// 
30 33%//AF 134579 

BRACE20011880 

BRACE20013400 

BRACE20013520 

BRACE20013740 
35 BRACE20013750 

BRACE20014230 

BRACE20014530//36,4 KDA PROLINE-RICH PROTEIN.//5.50E-10//102aa//34%//Q00451 

BRACE20014550//H EAT SHOCK FACTOR PROTEIN 1 (HSF 1) (HEATSHOCKTRANSCRIPTION FACTOR 1) (HSTF 
1 ) .//1 .OOE-1 1 8//229aa//997„//Q0061 3 
40 BRACE20014770//HUNTINGTIN ASSOCIATED PROTEIN 1 (HAP1),//1 .70E-22//81 aa//39%//P54256 

BRACE20014920//PROTEIN-TYROSINE PHOSPHATASE-LIKE N PRECURSOR (R-PTP-N) (ISLETCELL AUTOAN- 
TIGEN 512) (ICA512).//3.10E-42//110aa//84%//P56722 

BRACE20015080//PROTEIN-LYSINE 6-OXIDASE PRECURSOR (EC 1.4.3.13) (LYSYL OXIDASE) , //1 .30E-06// 
110aa//35%//Q05063 
45 BRACE20015430 

BRACE20016730//MUS musculus mdgl-1 mRNA, complete cds.//3.00E-54//118aa//83%//AF1 90624 
BRACE20016920 

BRACE20017370//Rvivax pval gene.//2.70E-20//99aa//49%//X92485 

BRACE20018550//B-CELL LYMPHOMA 3-ENCODED PROTEIN (BCL-3 PROTEIN).//9.20E-16//300aa//30%// 
50 P20749 

BRACE20018590//NOVEL ANTIGEN 2 (NAG-2) (TSPAN-4).//8.30E-28//69aa//91%//014B17 
BRACE20018650 

BRACE20018g80 
BRACE20019440 
55 BRACE20020500 

BRACE20020910//ZINC-FINGER PROTEIN RFP (RET FINGER PROTEIN).//1 .10E-31//91aa//49%//Q62158 

BRACE20021510 

BRACE20021760 
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BRACE20022020//SERINEyTHREONINE-PROTEIN KINASE SNK (EC 2.7.1.-) (SERUM INDUCIBLE KINASE).// 

1 .60E-41//1 02aa//47%//P53351 

BRACE20022270 

BRACE20024090//HOMEOBOX PROTEIN MEIS3 (MEISI -RELATED PROTEIN 2).//1 .50E-108//210aa//89%// 

5 P97368 

BRACE2002431 0//P53-INDUCED PROTEIN 11 .//5.00E-37//111 aa//69%//014683 

BRACE20024680//Homo sapiens GalNAc-T9 mRNA for UDP-GalNAc:polypeptlde N-acetylgalactosamlnyltrans- 
ferase, complete cds.//3.00E-153//244aa//99%//AB040672 
BRACE20024780//NEURALIZED PR0TEIN.//1 .20E-14//95aa//38%//P29503 
10 BRACE20024950 
BRAGE20025900 

BRACE20026350//SODIUM/NUCLEOSIDE COTRANSPORTER (NA(-i-)/NUCLEOSIDE COTRANSPORTER).// 
3 .40 E-25//53aa//96%//P26430 

BRACE20026850//Homo sapiens androgen-regulated short-chain dehydrogenase/reductase 1 (ARSDR1) mRNA, 
15 complete cds.//6.50E-1 20//31 3aa//72%//AF1 67438 

BRACE20027360//Homo sapiens mRNA for fnjctosamlne-3-klnase (FN3K gene)//3.00E-80//150aa//91%//AJ404615 

BRAGE20027520 

BRAGE20027550//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1).//1 .40E-1 9//1 28aa//44%//P49646 
BRAGE20027720//HYDROXYACYLGLUTATHIONE HYDROLASE (EC 3.1.2.6) (GLYOXALASE II) (GLX ll).//2.50E- 

20 35//1 33aa//50%//Q 1 6775 

BRACE20027920//L-RIBULOKINASE (EC 2.7.1 .16).//5.20E-40//387aa//29%//P94624 
BRACE20027960//Rattus norveglcus neurabin mRNA, complete cds.//2.70E-10//48aa//66%//U72994 
BRACE20028120//MUS musculus GTPase Rab37 (Rab37) mRNA, complete cds.//4.60E-48//129aa//78%//AF233582 
BRACE20028600 

25 BRACE20028610 

BRACE20028960//MUS musculus mRNA for Ca2+ dependent activator protein for secretion, complete cds.//6.10E- 

1 95//473aa//74%//D8621 4 
BRAGE20030780 

BRAGE20031100//PATGHED PROTEIN HOMOLOG 1 (PTC1) (PTC).//3.00E-23//234aa//25%//Q61115 
30 BRAGE20032850 

BRAGE20033190 

BRAGE20033980 

BRAGE20034310 

BRACE20034490 
35 BRACE20035160 

BRAGE20035270 

BRAGE20035390 

BRACE20035940 

BRACE20071380//PR0B CALCIUIVI-TRANSPORTING ATPASE 3 (EC 3.6.1.38) (ENDOPLASMIC RETICULUM 
40 CA2+-ATPASE) .//3 .50 E-65//343aa//39%//P39524 
BRACE20071530 

BRAGE20071740//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//4.90E-157//380aa//61%//P51523 
BRACE20071970 
BRACE20072010 
45 BRACE20072320 
BRACE20072810 

BRACE20074010//Oryctolagus cunlculus peroxisomal Ca-dependent solute carrier mRNA, complete cds. //2. 30E- 
1 40//383aa//65%//AF0041 61 

BRACE20074470//MUS musculus partial mRNA for mouse fat 1 cadherin (mfati gene). //3.90E-131//454aa//57%// 
50 AJ250768 

BRAGE20075020 

BRACE20075270 

BRAGE20075380 

BRACE20075630 
55 BRAGE20076210 

BRAGE20076410//Mouse mRNA for seizure-related gene product 6 type 2 precursor, complete cds.//5.10E-173// 

320aa//94%//D64009 

BRACE20076460 
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BRACE20076630//Dog nonetythroid beta-spectrin mRNA//1 .60E-05//86aa//37%//L02897 

BRACE20076850//Homo sapiens cell cycle checkpoint protein CH FR mRNA, complete cds.//1 .20E-54//1 04aa//99%// 
AF1 70724 

BRACE20077080 
5 BRACE20077270 
BRACE20077610 
BRACE20077640 
BRACE20077670 
BRACE20077680 

10 BRACE20077840//Putative Proteinthatmediatesattachmentofautophagosomes to microtubules, by similarity to yeast 
aut2 [Schizosaccharomyces pombe].//1 .00E-08//200aa//26%//CAC00556 
BRACE20077980 

BRACE20078680 PROTEIN.//2,40E-05//140aa//30%//P44678 

BRACE20078820//actln-depolymerlzlng protein N-WASP//5.50E-06//1 1 6aa//37%//S72273 
15 BRACE20079020 
BRACE20079530 
BRACE200a097Q 
BRACE20081140 
BRACE20083aOQ 

20 BRACE20083850//Rattus norveglcus N RBF-2 mRNAfor nuclear receptor binding factor-2, complete cds.//1 .30 E-1 35// 
288aa//90%//AB024930 

BRACE20084430//PROTEIN DISULFIDE iSOMERASE-RELATED PROTEIN PRECURSOR (EC 5.3.4.1) (PDIR).// 
8.30E-1 02//1 86aa//98%//Q1 4554 
BRACE20084800 
25 BRACE200B4880 
BRACE20086530 

BRACE20086550//Homo sapiens mRNAforGABAB receptor, subunit 1b.//8.30E-22//55aa//90%//AJ012186 
BRACE20087080 
BRACE200a7540 
30 BRACE20088570 
BRACE200B9600 
BRACE20089990 
BRACE20090140 

BRACE20091880//iy/lus musculus mRNAforsynaptotagmin V, complete cds.//9.90E-85//166aa//96%//AB026802 
35 BRACE20092120 
BRACE20092740 
BRAGE20092750 

BRACE20093070//P120 PR0TEIN.//1 .40E-13//121 aa//41%//P30999 

BRACE20093110 
40 BRACE20093610 
BRACE20094370 
BRACE20095170 

BRAWH1 000001 0//Homo sapiens PMEPA1 protein (PiVIEPAl) mRNA, complete cds.//2.00E-84//250aa//67%// 
AF224278 

45 BRAWH1 0000020//Homo sapiens putative hepatic transcription factor (WBSCR1 4) mRNA, complete cds.//1 .20E-1 66// 
345aa//89%//AF1 56603 
BRAWH1 0000070 

BRAWH10000370//UROKINASE PLASMINOGEN ACTIVATOR SURFACE RECEPTOR PRECURSOR (U-PAR) 
(CDB7),//9.40E-08//155aa//29%//005588 
50 BRAWH1 0000940//Xenopus iaevis mRNA for Nfrl, complete cdsy/6.1 0E-257//606aa//77%//D86491 

BRAWH1 0001 300 

BRAWH10001620//Rattus norveglcus development-related protein mRNA, complete cds//1 .90E-115//339aa//93%// 

AF045564 

BRAWH10001640//Homo sapiens KRAB zinc finger protein (RITA) mRNA, complete cds.//5.10E-14//57aa//88%// 
55 AF272148 

BRAWH1 0001 680//Homeotic protein emx2//9.60E-1 26//252aa//92%//1 51 737 
BRAWH1 0001 740 
BRAWH1 0001 800 
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BRAWH20000340//rRP-1 85 protein//7.20E-28//68aa//97%//S62356 

BRAWH20000480//MUS musculus kinesin motor protein KIFC2 mRNA, complete cds.//1.50E-120//270aa//85%// 
U92949 

BRAWH20000930 

5 BRAWH20001090//SARCALUMENIN PRECURS0R.//1 .90E-05//363aa//24%//P13666 

BRAWH20001770//SERINE HYDROXYMETHYLTRANSFERASE: MITOCHONDRIAL PRECURSOR (EC 2.1.2.1) 
(SERINE METHYLASE) (GLYCINE HYDROXYMETHYLTRANSFERASE) (SHMT).//3.20E-41//77aa//1 00%//P34897 
BRAWH20002480 

BRAWH20003230//Proline rich protein//2.00E-29//142aa//52°/y/CAA48321 
10 BRAWH20004430//Human breast cancer estrogen regulated LIV-1 protein (LIV-1) mRNA, partial cds.//1 .OOE-46// 
164aa//46%//U41060 

BRAWH20004760//IVIUS musculus mRNA for Eos protein, complete cds.//1.80E-92//180aa//92%//AB01 761 5 
BRAWH20005030//REGULATOR OF MITOTIC SPINDLE ASSEIVIBLY 1 (RMSA-1).//6.50E-1B//103aa//52%//P49646 
BRAWH20005220//Homo sapiens hD54+ins2 isoform (hD54) mRNA, complete cds.//1 .20E-77//206aa//80%// 
15 AF004430 

BRAWH20005540 

BRAWH20006330//Homo sapiens mRNA for zinc finger 2 (ZNF2 gene).//1 .20E-120//214aa//98%//X60152 
BRAWH2000651 0//HYDROXYMETHYLGLUTARYL-COA LYASE (EG 4.1 .3.4) (HMG-CCA LYASE) 
(HL) (3- HYDROXY-3-METHYLGLUTARATE-COA LYASE).//1 .60E-96//23Baa//73%//P35915 BRAWH20006860// 
20 SERINEA-HREONINE-PROTEIN KINASE SGK (EC 2.7.1 .-) 

(SERUM/GLUCOCORTICOID-REGULATED KINASE).//4.20E-223//406aa//99%//000141 

BRAWH20006970 

BRAWH2000B660 

BRAWH20008920 

25 BRAWH20009010//Human (c-myb) gene, complete primary cds, and five complete alternatively spliced cds.//9.70E- 
28//1 05aa//59%//U22376 

BRAWH20009440//Arabidopsis thaliana pollenless3 (178) gene, complete cds; beta-9 tubulin (TUB9) gene, partial 
cds; and unknown gene.//7.90E-31//271aa//34%//AF060248 

BRAWH20009840//CYTOCHROME P450 2J2 (EG 1.14.14.1) (GYPIIJ2) (ARAGHIDONIG ACID EPOXYGENASE).// 

30 1 .1 0E-146//273aa//98%//P51589 
BRAWH20011030 

BRAWH20011290//OCCLUDIN.//2.90E-07//1 74aa//29%//Q61146 
BRAWH2001 1 41 0//CUTICLE COLLAGEN 2.//2.30E-05y/129aa//35%//P1 7656 

BRAWH20011660//BETA-GALACTOSIDASE PRECURSOR (EG 3.2.1.23) (LACTASE) (ACID BETA- GALAGTOSI- 
35 DASE).//1 .40E-1 05//421 aa//49%//P1 6278 
BRAWH200 12030 

BRAWH20014180//Homo sapiens double-stranded RNA specific adenosine deaminase (ADAR3) mRNA, complete 

cds.//3.60E-97//1 79aa//1 00%//AF034837 
BRAWH20014380 
40 BRAWH20014610 

BRAWH20014840//POLYPEPTIDE N-ACETYLGALACTOSAMINYLTRANSFERASE (EC 2.4.1.41) (PROTEIN- UDP 
ACETYLGALACTOSAMINYLTRANSFERASE) (UDP-GALNAC:POLYPEPTIDE, N-AGETYLGALACTOSAMINYL- 
TRANSFERASE) (GALNAC-T1).//4.50E-84//511aa//36%//Q07537 
BRAWH20015030 

45 BRAWH20036890//MUS musculus clone mousel -9 putative protein phosphatase type 2C mRNA, partial cds.//2.70E- 

59//1 20aa//98%//AF1 1 7832 

BRAWH20036930 

BRAWH20038320 

BRAWH20040950 
50 BRAWH20047310 

BRAWH20052250 

BRAWH20059980//BONE MORPHOGENETIC PROTEIN 1 PRECURSOR (EC 3.4.24.-) (BMP-1).//8.60E-37//282aa// 

32%//P98070 
BRAWH20060440 

55 BRAWH20064500//Homo sapiens CAGF9 mRNA, partial cds.//8.00E-25//148aa//51%//U80736 
BRAWH20064930 

BRAWH20066220//DYNEIN GAMMA CHAIN, FLAGELLAR OUTER ARM.//3.20E-41//221aa//39%//Q39575 
BRAWH20069600 
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BRAWH20069890//DNA-DIRECTED RNA POLYMERASE II LARGEST SUBUNIT (EC 2.7.7.6) (RPB1) (FRAG- 
MENT) .//9.50E-07//1 88aa//30%//P1 1 41 4 
BRAWH20074060 

BRAWH20076050//LORICRIN.//2.80E-05//1 60aa//31 %//P1 81 65 
5 BRAWH20087060 
BRAWH20089030 

BRAWH20089560//Protein-tyrosine-phosphatase (EC 3.1 .3.48) 

TD 1 4//0//736aa//90%//T1 4355 

BRAWH20092270 

10 BRAWH20092610//rLM PROTEIN (TLM ONCOGENE).//3.90E-15//122aa//43%//P17408 
BRAWH20093600 
BRAWH20094850 

CD34C2000051 0//Human chltotriosidase precursor mRNA, complete cds.//7.80E-247//366aa//98%//U29615 
CTONG20003030 

15 CTONG20005890//CHANNEL ASSOCIATED PROTEIN OF SYNAPSE-110 (CHAPSYN-110).//3.10E-18//241aa// 

31%//Q15700 
CTONG20007710 
CTONG20008270 
CTONG20011390 

20 CTONG20013200//HYPOTHETICAL PROTEIN C2G11.15C IN CHROMOSOME I (FRAGMENT),//1.60E-15//130aa// 
36%//Q09814 

CTONG20013660//GLUGOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 

(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//5.00E-1 1//468aa//23%//P08640 

CTONG20015330 

25 CTONG2001 8200//MUS musculus NSD1 protein mRNA, complete cds.//0//1 061 aa//87%//AF064553 
CTONG20019110//UBIQUITIN CARBOXYL-TERMINAL HYDROLASE 4 (EC 3.1 .2.15) 
(UBIQUITIN THIOLESTERASE 4) (UBIQUITIN-SPECIFIC PROCESSING PROTEASE 4) 

(DEUBIQUITINATING ENZYME 4) (UBIQUITOUS NUCLEAR PROTEIN HOMOLOG).//5,40E-1 9//11 6aa//39%// 
Q13107 

30 CTONG2001 9550//Homo sapiens mRNA for actin binding protein ABP620, complete ods.//0//1 1 75aa//53%//AB029290 
CTONG20020730 
CTONG20021430 

CTONG20024180/yHomo sapiens scaffold attachment factor B (SAF-B) mRNA, partial cds.//3.10E-52//366aa//40%// 
L43631 
35 CTONG20024630 

CTONG20025580//ZINC FINGER PROTEIN 211 (ZINC FINGER PROTEIN C2H2-25).//2.00E-58//223aa//45%// 
01 3398 

CTONG20027210//VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN VPS13.//7.90E-06//224aa//24%// 

Q07878 
40 CTONG20028030 

CTONG20028160//Homo sapiens cadherln-like protein VR20 mRNA, partial cds.//9.70E-170//290aa//100%// 
AF1 69690 

CTONG20028200//MUS musculus MGA protein mRNA, complete cds.//0//1132aa//85%//AF205935 
CTONG20029650 

45 CTONG20037820//GAMMA-AMINOBUTYRIC-ACID RECEPTOR PI SUBUNIT PRECURSOR (GABA(A) RECEP- 
TOR).//4.30E-94//164aa//93%//000591 

CTONG20047160//Rattus norveglcus mRNA for seven transmembrane receptor, complete cds.//1.10E-26//319aa// 
29%//AB019120 

CTONG20055530//ANKYRIN 2 (BRAIN ANKYRIN) (ANKYRIN B) (ANKYRIN, NONERYTHROID).//1 .90E-59//598aa// 
50 30%//Q01484 

CTONG20064490//Drosophiia melanogaster 268 proteasome regulatory complex subunit p42A mRNA, complete 
cds.//1 .00E-41//1 08aa//77%//AF1 45308 

D30ST20001840//RNA binding motif protein 9 [Homo saplens].//1 .OOE-1 39//297aa//91%//NP_055124 
DFNES20002120//MUS musculus tgt mRNA for tRNA-guanlne transglycosylase; complete cds.//1 .40E-62//140aa// 
55 837o//AB034632 

DFNES20002680//MYOSIN HEAVY CHAIN, STRIATED MUSCLEy/9.00E-26//620aa//24%//P24733 
DFNES20002920 

DFNES20003350//CELL SURFACE GLYCOPROTEIN 1 PRECURSOR (OUTER LAYER PROTEIN B) (S-LAYER 
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PROTEIN 1).//5.30E-05//277aa//25%//Q06852 

DFNES20004320//Homo sapiens ubiquitous TPR-motif protein Y isoform (UTY) gene, partial cds; alternatively 
spliced.//3.20E-15//85aa//50%//AF265575 

FCBBF10005980//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//3.20E-151//395aa//57%//P51523 
5 FCBBF1 00061 80 
FCBBF1 0006750 

FCBBF1 0006860 

FCBBF10006870//MUS musculus Rap2 interacting protein 8 (RPIP8) mRNA, complete cds.//3.10E-113//456aa//53%// 
U73941 
10 FCBBF1 0006910 
FCBBF1 0007320 
FCBBF1 0007600 

FCBBF20000940//MONOCYTIC LEUKEMIA ZINC FINGER PROTEIN (ZINC FINGER PROTEIN 220).//2.70E-05// 
194aa//28%//Q92794 
15 FCBBF20001050 
FCBBF20001950 
FCBBF20002320 

FCBBF20002760//ALPHA SCRUIN.//2.70E-05//214aa//24%//Q25390 
FCBBF20005760 

20 FGBBF20005910//KINESIN LIGHT CHAIN (KLC).//6.40E-10//114aa//39%//P46825 
FCBBF20006770 

FCBBF20007330//DIPEPTIDYL PEPTIDASE IV LIKE PROTEIN (DIPEPTIDYL 

AMINOPEPTIDASE- RELATED PROTEIN) (DIPEPTIDYLPEPTIDASE VI) (DPPX-L/DPPX-S).//1.70E-19//96aa// 
45%//P46101 
25 FCBBF20008080 

FCBBF20008150//ZYXIN.//7.60E-57//350aa//34%//004584 
FCBBF20009400 

FCBBF20009510//ZINC FINGER PROTEIN 7 (ZINC FINGER PROTEIN KOX4) (ZING FINGER PROTEIN HR16).// 
2.30E-76//225aa//43%//P1 7097 
30 FCBBF2001 211 0//Leishmanla major partial ppgl gene for proteophosphoglycan.//3.50E-05//256aa//24%//AJ243460 
FCBBF20012990 
FCBBF20014800 

FCBBF200153B0//Homo sapiens long chain polyunsaturated fatty acid elongation enzyme (HEL01) mRNA, complete 
cds. //7. 00E-61//319aa//44%//AF231981 
35 FCBBF20016720 
FCBBF20017180 
FCBBF20017200 

FCBBF40002820//ELECTRON TRANSFER FLAVOPROTEIN BETA-SUBUNIT (BETA-ETF).//3.60E-121//239aa// 

99%//P38117 

40 FCBBF50002610//ZINC FINGER PROTEIN 35 (ZFP-35).//1 .40E-137//489aa//50%//P1 5620 

FEBRA20000350//MUS musculus MAST205 protein kinase mRNA, complete cds,//6.20E-71//190aa//76%//U02313 
FEBRA20000530//Drosophila melanogaster Diablo (dbo) mRNA, complete cds.//6,40E-58//495aa//32%//AF237711 
FEBRA20001 050//KINESIN LIGHT CHAIN (KLC).//4.20E-207//566aa//69%//007866 

FEBRA20001290//PROBABLE TRNA (5-METHYLAMINOMETHYL-2-THIOURIDYLATE)-METHYLTRANSFERASE 
45 (EC 2.1 .1 .61).//4.60E-43//82aa//1 00%//075648 

FEBRA20003110//Homo sapiens UDP-GlcNAc:a-3-D-mannoside b1 ,2-N-acetylglucosaminyltransferase 1.2 

(MGAT1.2) mRNA, partial cds.//3.80E-92//172aa//98%//AF250859 

FEBRA20003300 

FEBRA20003770//Homo sapiens ankyrin repeat-containing protein (CCM1) mRNA, complete cds.//0//406aa//100%// 
50 AF296765 

FEBRA20003780 
FEBRA20003910 

FEBRA20003970//ZINC FINGER PROTEIN 228 .//1 .60E-11 8//423aa//52%//Q9UJU3 
FEBRA20003990//ZINC FINGER PROTEIN 45 (BRC1 744).//4.60E-1 30//502aa//50%//Q02386 
55 FEBRA20004040 

FEBRA20004150//DRA PROTEIN (DOWN-REGULATED IN ADENOMA).//2.70E-30//150aa//37%//P40879 
FEBRA20004520 

FEBRA20004540//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//2.40E-252//425aa//99%//P51522 
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FEBRA20004910 

FEBRA20005360//Homo sapiens paraneoplastic cancer-testis-brain antigen (MA5) mRNA, complete cds.//5.50E-75// 
375aa//44%//AF0831 1 6 
FEBRA20006560 
5 FEBRA20006800 
FEBRA20006900 

FEBRA20007330//45 KDA CALCIUIVI-BINDING PROTEIN PRECURSOR (STROIVIA CELL-DERIVED FACTOR 4) 

(SDF-4).//3.30E-63//1 1 7aa//94%//061 1 1 2 

FEBRA20007400 

10 FEBRA20007570//Homo sapiens BM-009 mRNA, complete cds.//1 .30E-59//189aa//66%//AF208851 
FEBRA20007710 

FEBRA20007720//MUS musculus strain ICR 90 kDa actln-assoclated protein palladin mRNA, partial cds.//8.10E-06// 
1 44aa//29%//AF205079 

FEBRA20007870//Homo sapiens putative transcription factor CR53 (CR53) mRNA, partiai cds.//6.60E-79//144aa// 
15 100%//AF017433 
FEBRA20008090 

FEBRA20008560//HYPOTHETICAL 40.9 KDA PROTEIN C08B11.5 IN CHROMOSOiVIE ll.//4.30E-05//91aa//31%// 
Q09442 FEBRA20008740 

FEBRA20008800//SARCALUIVIENIN PRECURSOR.//2.10E-07//199aa//30%//P13666 FEBRA20008810//ACTIN 6 
20 (FRAGI\/1ENT).//1 ,00E-103//369aa//50%//P53459 
FEBRA20009010 
FEBRA20009590 

FEBRA20009720//ZINC FINGER PROTEIN 184 (FRAGIVIENT).//2.30E-145//514aa//51%//Q99676 
FEBRA20010930//MONOCARBOXYLATE TRANSPORTER 4 (MOT 4).//9.00E-22//333aa//28%//015374 
25 FEBRA20011330//26S PROTEASOME REGULATORY SUBUNIT S3 (PROTEASOME SUBUNIT 
P58),//2.1 0E-54//1 13aa//1 00%//043242 

FEBRA20011460//ZiNC FINGER PROTEIN 174 (AW-1)y/1 .90E-12//60aa//55%//Q15697 
FEBRA20011970 
FEBRA20012270 
30 FEBRA20012450//NAG14//4.90E-24//399aa//27%//AF196976 
FEBRA20012940 
FEBRA20013510 
FEBRA20014870 

FEBRA20014920//MUS musculus pecanex 1 mRNA, complete cds.//6.50E-120//313aa//72%//AF096286 
35 FEBRA20015840//DELTA-LIKE PROTEIN PRECURSOR (DLK) (PREAD1 POCYTE FACTOR 1) (PREF-1) (ADI- 
POCYTE DIFFERENTIATION INHIBITOR PROTEIN) [CONTAINS: FETAL ANTIGEN 1 (FA1)].//2.90E-64//323aa// 

39%//Q09163 

FEBRA20015900 

FEBRA20015910 

40 FEBRA20017060//Human APEG-1 mRNA, complete cds.//7.10E-57//113aa//100%//U57099 

FEBRA20017150//ZING-BINDING PROTEIN A33.//4.00E-1 0//322aa//21 %//Q02084 FEBRA20017900//Xenopus lae- 
vis RRiVl-containing protein SEB-4 mRNA, compiete cds.//1 .20E-79//180aa//88%//AF223427 
FEBRA20019890//HYPOTHETICAL PROTEIN KIAA0167.//1 .70E-1 80//339aa//56%//Q99490 
FEBRA20020860 

45 FEBRA20021910 
FEBRA20021940 
FEBRA20024290 

FEBRA20024420//Homo sapiens partial mRNA for choline dehydrogenase (chdh gene).//1.10E-71//143aa//98%// 
AJ272267 

50 FEBRA20025250//HYPOTHETICAL 73.0 KDA PROTEIN IN CLA4-PUS4 INTERGENIC REGI0N.//1 .40E-09//1 72aa// 

29%//P48566 

FEBRA20027270 

FEBRA20027830 

FEBRA20028820 
55 FEBRA20028970 

FEBRA20029080 

FEBRA20030540//Halocynthia roretzi mRNAforHrPET-1, complete cds.//2.80E-25//155aa//34%//AB029334 
FEBRA20031550 
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FEBRA20033080 

FEBRA20034290//RESTIN (CYTOPLASMIC LINKER PROTEIN-170) (CLIP-170).//6.90E-21//87aa//51%//042184 
FEBRA20037070 

FEBRA200411 00//PHOSPHOLIPASE ADRAB-B PRECURSOR (EC 3.1 .-.-)./n .10E-119//259aa//83%//Q05017 

5 FEBRA20041910 
FEBRA20042240 
FEBRA20042370 
FEBRA20042930 

FEBRA20043250//Canis familiaris mRNA for C3VS protein.//1 .90E-1 91//589aa//66%//X99145 
10 FEBRA20043290//MYOSIN HEAVY CHAIN, CARDIAC MUSCLE ISOFORM (FRAGMENT).//0//975aa//65%//P2961 6 
FEBRA20044120 
FEBRA20044430 

FEBRA20044900//R.norveglcus mRNA for CPG2 protein.//8.60E-244//509aa//89%//X95466 

FEBRA20045920//Homo sapiens mRNA for putative slaloglycoprotease type 2.//5.70E-1 87//273aa//98%//AJ2951 48 
15 FEBRA20048180//DRR1 PROTEIN (TU3A PROTEIN).//a.80E-58//1 31 aa//87%//095990 

FEBRA20050140//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//4.40E-125//505aa//45%//P51523 

FEBRA20050790//PROTEIN-TYROSINE PHOSPHATASE STRIATUM-ENRICHED (EC 3.1.3.48) 

(STEP) (NEURAL-SPECIFIC PROTEIN-TYROSINE PHOSPHATASE) (FRAGMENT).//3.60E-66//1 49aa//83%// 

P54829 

20 FEBRA20052160//PUTATIVE GLUTAMINE-DEPENDENT NAD(+) SYNTHETASE (EC 6.3.5.1) (NAD(+) SYNTHASE 
[GLUTAMINE-HYDROLYSING]).//6.50E-33//105aa//64%//P38795 
FEBRA20053770 

FEBRA20053800//Homo sapiens ubiquitous TPR-motif protein Y isoform (UTY) gene, partial cds; alternatively 
spliced.//3.20E-08//65aa//49%//AF265575 
25 FEBRA20054270 
FEBRA20057260 
FEBRA20057520 

FEBRA20057780//INTEGUMENTARY MUCIN A.I PRECURSOR (FIM-A.1) (PREPROSPASMOLYSIN),//2.40E-07// 

120aa//27%//P 10667 

30 FEBRA20057880//LIM domain only 7 isoform c [Homo sapiens]//1 .20E-287//545aa//99%//NP_056667 
FEBRA20059980 

FEBRA20060920//SEGMENT POLARITY PROTEIN DiSHEVELLED.//6.60E-15//84aa//41%//P51140 
FEBRA20081500 

FEBRA20062700//PUTATIVE novel haloacid dehalogenase-iii<e hydrolase family protein similar to (archaea) bacterial 
35 proteins) [Homo sapiens].//0//209aa//1 00%//CAB43550 

FEBRA20063150//Homo sapiens topoisomerase 11 aipha-4 (T0P2A) mRNA, partial cds.//8.20E-22//73aa//73%// 

AF285159 
FEBRA20063540 

FEBRA20064760//ZINC FINGER PROTEIN 184 (FRAGMENT).//2.10E-182//547aa//54%//Q99676 

40 FEBRA20066270 
FEBRA20066670 

FEBRA20067360//HYPOTHETICAL ZING FINGER PROTEIN KIAA0961 ,//7.00E-1 34//472aa//54%//Q9Y2G7 
FEBRA20067930//PERSEPHIN PRECURSOR (PSP).//7.90E-23//50aa//100%//060542 FEBRA2006B730//Trg pro- 
tein//1 .00E-B2//560aa//37%//1 60486 
45 FEBRA20069420//ZiNC FINGER PROTEIN 33A (ZINC FINGER PROTEIN K0X31) (KIAA0065) (HA0946) (FRAG- 
MENT) .//3 .60E- 1 03//284aa//58%//Q06730 

FEBRA200701 70//Homo sapiens TRAF4-assoclated factor 2 mRNA, partial cds.//1 .40E-87//220aa//75%//U831 94 
FEBRA20072000//MYOSIN 11 HEAVY CHAIN, NON MUSCLE.//3.00E-08//645aa//21%//P08799 
FEBRA20072800//Human (c-myb) gene, complete primary cds, and five complete alternatively spliced cds.//4.10E- 

50 30//97aa//74%//U22376 
FEBRA20074140 
FEBRA20074580 

FEBRA20075510//RAS-RELATED PROTEIN RAB-6.//1 .00E-36//88aa//B8%//P20340 

FEBRA20075660//REGU OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1 ).//6.00E-09//84aa//46%//P49646 
55 FEBRA20076220 

HCASM1 000021 0//Plasmodium berghei strain NYU2 merozolte surface protein-1 mRNA, partial cds.//1 .50E-08// 
1 22aa//28%//AF00041 3 

HCASM10000610//HYPOTHETICAL 63.9 KD PROTEIN C1F12.09 IN CHROMOSOME l.//2.80E-14//116aa//31%// 
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Q10351 

HCASM 10001 150 
HCASM20002020 

HCASM20002140//G1/S-SPECIFICCYCLIN D3.//8.90E-118//226aa//99%//P30281 
5 HCASM20003070 
HCASM20005340 

HCASM20005360//Macrophage migration inhibitory factor//2.50E-17//45aa//100%/'/XP_000858 

HEART20000350/n"ransacylases//7.50E-35//267aa//35%//AAB94954 

HEART20000990 

10 HEART20003090//PTB-ASSOCIATED SPLICING FACTOR (PSF).//3.70E-07//143aa//30%//P23246 
HEART20004110 

HEART20004480/yTROPONIN T, CARDIAC MUSCLE ISOFORMS CTNTC).//2.80E-39//81aa//98%//P45379 
HEART20004920 

HEART20005060//ENAMELIN (TUFTELIN).//1 .60E-23//215aa//30%//P27628 HEART20005200//ANKYRIN 1//5.00E- 
15 24//250aa//37%//P1 61 57 
HEART20005680 

HHDPG20000550//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4.3) (ATP-AMP 
TRANSPHOSPHORYLASE).//1.60E-14//201aa//24%//P43188 

HHDPC20000950//'Cricetulus griseus layilin mRNA, complete cds.//8.00E-1 77//373aa//84%//AF093673 
20 HHDPC20001150//MUS musculus putative secreted protein ZSiG37 (Zsig37) mRNA, complete cds,//2.00E-91//1 99aa// 
83%//AF 192499 

HHDPC20001490//l\/lus musculus partial mRNA for muscle protein 534 (mg534 gene).//2.10E-80//167aa//88%// 

AJ250189 

HHDPC20003150 

25 HHDPC20004650//H .sapiens PTPL1 mRNA for protein tyrosine pliosphatase.//3.60E-32//371 aa//26%//X80289 

HHDPC20004560 

HHDPC20004620 

HLUNG1 0000240 

HLUNG10000300 
30 HLUNG1 0000370 

HLUNG1 0000640//KARYOGAIVIY PROTEIN KAR4.//2.20E-27//324aa//27%//P25583 

HLUNG10000760//MUS musculus mRNA formSox7, complete cds.//9.90E-186//388aa//87%//AB023419 

H LU NG 1 0OOOggoy/TR ICHOHYALI N .//7.00E-06//454aa//21 %//P22793 

HLUNG1 0001 050//iVIYOTUBULARIN.//4.60E-1 2//95aa//37%//Q1 3496 
35 HLUNG10001100//PISTIL-SPECIFIC EXTENSIN-LIKE PROTEIN PRECURSOR (PELP).//6.20E-06//117aa//30%// 

Q03211 

HLUNG20000680//ZINC FINGER PROTEIN 157.//6.30E-104//443aa//45%//P51786 HLUNG200011 60//AIG1 PRO- 

TEIN.//3.1 0E-24//271 aa//28%//P541 20 

HLUNG20001250 

40 HLUNG20001420//MUS musculus putative thymic stromal co-transporter TSCOT mRNA, complete cds.//1 .20E-1 89// 
478aa//766//AF148145 
HLUNG20001760 

HLUNG20002550//MAST CELL TRYPTASE PRECURSOR (EC 3.4.21 .59).//1 .1 0E-43//1 78aa//47%//P50342 
HLUNG20003140 
45 HLUNG20004120 
HLUNG20004800 
HLUNG20005010 

HSYRA10001190//PROBABLE GYP7 PROTEIN (FRAGMENT).//7.90E-08//157aa//25%//P09379 
HSYRA10001370//ZINC FINGER PROTEIN 184 (FFIAGIVIENT)y/1.70E-149//556aa//50%//Q99676 

50 HSYRA1 0001 480 

HSYRA10001680//HYPOTHETICAL HELICASE C28H8.3 IN CHROMOSOME lll.//9.30E-61//540aa//32%//Q09475 
HSYRA1 0001 780 

HSYRA20001350//CELL POLARITY PROTEIN TEA1 .//9.10E-16//211aa//28%//P87061 
HSYRA20002480 
55 HSYRA20002530 
HSYRA20003470 

HSYRA20005100//NAM7 PROTEIN (NONSENSE-MEDIATED MRNA DECAY PROTEIN 1) (UP-FRAMESHIFT SUP- 
PRESSOR 1).//6.70E-31//374aa//31%//P30771 
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HSYRA20006050//MYOSIN HEAVY CHAIN, CLONE 203 (FRAGMENT).//3.40E-11//282aa//20%//P39922 
HSYRA20006290//SKIN SECRETORY PROTEIN XP2 PRECURSOR (APEG PROTEIN).//2.10E-07//168aa//30%// 
P 17437 

HSYRA20006400//Homo sapiens FRG1 mRNA, complete cds.//1 .00E-50//112aa//91%//L761 59 

5 HSYRA20007600 
HSYRA20008280 
HSYRA20011030 
HSYRA20011530 

HSYRA20013320//INSULIN-LIKE GROWTH FACTOR BINDING PROTEIN 3 PRECURSOR (IGFBP-3) (IBP-3) 
10 (IGF-BINDING PROTEIN 3).//1 .60E-130//236aa//99%//P1 7936 
HSYRA20014200 
HSYRA20014760 

HSYRA20015740//GLUCOSAMINE-6-PHOSPHATE ISOMERASE (EC 5.3.1.10) (GLUCOSAMINE-6- PHOSPHATE 
DEAMINASE) (GNPDA) (GLCN6P DEAMINASE) (OSCILLIN).//1 .70E-133//274aa//87%//Q64422 
15 HSYRA20015aOQ 
HSYRA20016210 

HSYRA20016310//ZINC FINGER PROTEIN 138 (FRAGMENT).//3.80E-136//237aa//100%//P52744 
I MR321 0000440//Human transmembrane receptor precursor (PTK7) mRNA, complete cds.//1 .60E-208//388aa//99%// 
U40271 
20 IMR321 0000740 
IMR321 0000750 

IMR3210001580//Crlcetulus griseus layllln mRNA, complete cds.//2.30E-177//373aa//84%//AF093673 
IMR321 0001 650 

IMR3210002420//ZINC FINGER PROTEIN 33A (ZINC FINGER PROTEIN K0X31) (KIAA0065) (HA0946) (FRAG- 
25 MENT). //3. 80E-81//281 aa//47%//Q06730 

IMR321 0002660//ZINC/CADMIUM RESISTANCE PROTEIN.//2.50E-1 0//1 48aa//25%//P201 07 
IMR3220002230//HINT PROTEIN (PROTEIN KINASE C INHIBITOR 1) (PKCI-1) (17 KD 
INHIBITOR OF PROTEIN KINASE C).//7.90E-08//97aa//32%//P16436 

IMR3220003020//MUS musculus shd mRNA, complete cds.//3.80E-138//337aa//77%//AB018423 

30 IMR3220006090 

IMR3220007420//HYPOTHETICAL ZINC FINGER PROTEIN ZK686.4 IN CHROMOSOME III.//5.50E-50//21 1 aa// 
48%//P34670 

IMR3220007750//FOLLISTATIN-RELATED PROTEIN PRECURSOR (TGF-BETA-INDUCIBLE PROTEIN TSC-36).// 
3.60E-1 9//229aa//30%//Q62356 
35 IMR3220007910//SYNAPSINS lA AND IB (BRAIN PROTEIN 4.1 ).//5.00E-07//167aa//31%//P17600 

IMR3220008380//METHIONYL-TRNA FORMYLTRANSFERASE, MITOCHONDRIAL PRECURSOR (EC 2.1.2.9) 

(MTFMT) (FRAGMENT).//1 ,10E-147//301aa//90%//077480 
IMR3220008590 

IMR3220008630//PUTATIVE SPLICING FACTOR, ARGININE/SERINE-RICH 2 (SPLICING 
40 FACTOR SC35) (SC-35) (SPLICING COMPONENT, 35 KD).//3.50E-05//98aa//40%//00951 1 
IMR32200091 90//METHIONINE AMINOPEPTIDASE 2 (EC 3.4.11 .18) (METAP 2) 

(PEPTIDASE M 2) (INITIATION FACTOR 2 ASSOCIATED 67 KD GLYCOPROTEIN) (P67).//1 .90E-230//41 8aa//92%// 
008663 

IMR3220009350 
45 IMR3220009530 

I M R3220009730//DN A43 P ROTEI N .//6 .20 E-1 3//367aa//23%//P32354 

IMR3220009840 

IMR3220011850 

IMR32200121B0//MUS musculus pseudourldlne synthase 3 (Pus3) mRNA, complete cds.//3.10E-221//483aa//84%// 
50 AF266505 

IMR322001 31 70//Homo sapiens mRNA for protein phosphatase, complete cds.//9.1 0E-41//1 95aa//48%//AB027004 
IMR3220013320//NEUROFILAMENT TRIPLET M PROTEIN (160 KD NEUROFILAMENT PROTEIN) (NF-M).//5.10E- 

08//357aa//22%//P08553 

I M R32200 1 4350//HYPOTH ETICAL PROTEI N KIAA0025.//9.70E-60//408aa//37%//Q1 50 1 1 
55 IMR322001 491 0//Rattusnorvegicustricarboxylatecarrler-Ilke protein mRNA, completecds.//3. 90E-43//137aa//57%// 
AF276997 
IMR3220016000 
IMR3220017240 
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KIDNE10000080//Xenopus laevis alpha-1 collagen type II' mRNA, complete cds.//8.60E-07//244aa//29%//M63596 
KIDNE1 0000280 
KIDNE1 0000500 

KIDNE1 0001 040//SYNTAXIN 7.//1 .10E-32//249aa//32%//015400 
5 KIDNE10001430 
KIDNE10001450 

KIDNE10001520//MUS musculus yolk sac permease-like molecule 1 (YSPL-1) mRNA, complete cds7/6.40E-73// 
159aa//77%//U25739 

KIDNE20000410//ALANINE--GLYOXYLATE AMINOTRANSFERASE 2 PRECURSOR (EC 2.6.1.44) (AGT 2) (BE- 
10 TA-ALAN I NE- PYRUVATE AMINOTRANSFERASE) (BETA-ALAAT ll).//1 .00E-71//1 67aa//85%//Q64565 
KIDNE20000510//ZINC FINGER PROTEIN 133.//5.10E-156//503aa//56%//P52736 
KIDNE20000700 
KIDNE20000850 

KIDNE20001670//MUS musculus mRNA for RST, complete cds.//6.80E-123//331aa//72%//AB005451 
15 KIDNE20001920 
KIDNE20002440 
KIDNE2G0G2450 
KIDNE20002660 

KIDNE2G0Q315Q//AQUAPORIN-7 LIKE (AOUAPORIN ADIPOSE) (AQPAP).//1 .B0E-44//95aa//92%//014520 
20 KIDNE20003300/lkaros-llke proteliV/l .50E-45//1 60aa//41%//AAC34387 

KIDNE20003490//MUS musculus putative lysophosphatidic acid acyltransferase mRNA, complete cds.//6.10E-124// 
291 aa//77%//AF01 5811 

KIDNE20003750//MUS musculus mRNA for granuphilin-a, complete cds.//1 .30E-31//173aa//42%//AB02525B 
KIDNE20004030 

25 KIDNE20004220//Homo sapiens topolsomerase II alpha-4 (T0P2A) mRNA, partial cds.//3.70E-21//76aa//72%// 
AF285159 

KIDN E20004970//TRICHOHYALIN.//1 .50E-06//244aa//27%//P37709 

KIDNE20005130//ALANINE--GLYOXYLATE AMINOTRANSFERASE 2 PRECURSOR (EC 2.6.1.44) (AGT 2) (BE- 
TA-ALANINE-PYRUVATE AMINOTRANSFERASE) (BETA-ALAAT ll),//1 .30E-153//337aa//B4%//Q64565 
30 KIDNE20005170//HYPOTHETICAL 49,1 KD PROTEIN C11D3.06 IN CHROMOSOME l,//2.20E-30//247aa//31%// 
01 0085 

KIDNE20005190/n"ONBPROTEIN.//2.60E-08//93aa//34%//006432 

KIDNE20005740//Staphylococcus epidermidis putative cell-surface adhesin SdrF (sdrF) gene, complete cds.//3.10E- 
34//372aa//28%//AF245041 

35 KIDNE20031850//Ras association (RalGDS/AF-6) domain family 2; KIAA0168 gene product [Homo saplens]//4.00E- 
66//250aa//59%//N P_055552 

KIDNE20033050//PUTATIVE AMIDASE AF1954 (EC 3.5.1 .4).//8. 30E-34//242aa//32%//028325 

KIDNE20033350 

KIDNE20033570 

40 KIDN E20033730//Homo sapiens Asef mRNA for APC-stimulated guanine nucleotide exchange factor, complete cds.// 
3.90E-1 84//572aa//61 %//AB0421 99 
KIDNE20033770 
KIDNE20037520 

KIDNE20039410//HYPOTHETICAL37.2 KDA PROTEIN G12C2.09G IN CHROMOSOME ll.//3.20E-19//209aa//22%// 
45 Q09749 

KIDNE20039940//ZINC FINGER PROTEIN 191.//1.40E-82//308aa//56%//014754 

KIDNE20040340 

KIDNE20040540 

KIDNE20040840//Morone saxatills myosin heavy chain FM3A (FM3A) mRNA, complete cds.//0//1 1 35aa//64%// 
50 AF003249 

KIDNE20042620 
KIDNE20042940 

KIDNE20042950//Human mRNA for prepro-alpha2(l) collagen (C0L1 A2),//1 .40E-05//96aa//37%//Y00724 
KIDNE20043440//Vacuolar protein sorting-associated protein - fission yeast//9.00E-34//400aa//33%//T39106 
55 KIDN E20044110//Homo sapiens vacuolar proton pump 116 kOa accessory subunit 

(ATP6N1B) mRNA, complete cds, alternatively spliced.//3.80E-278//322aa//91%//AF245517 

KIDNE20045200 

KIDNE20045340 
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KIDNE20045790 

KIDNE2004681 0//Mus musculus peroxisomal long chain acyl-CoA thioesterase lb (Pte1 b) gene, exon 3 and complete 
cds.//2.80E-87//21 9aa//73%//AF1 80801 

KIDNE20048280//MUS musculus orphan transporter isoform A12 (Xtrp2) mRNA, 
5 alternatively spliced, complete cds.//2,70E-265//600aa//76%//AF075262 
KIDNE20048640 
KIDNE20048790 
KIDNE20049810 

KIDNE20050420//LYSOSOMALTRAFFICKING REGULATOR (BEIGE HOMOLOG).//5.00E-97//283aa//50%//Q99698 
10 KIDNE20052960//ACTIN, CYTOPLASMIC 1 (BETA-ACTIN).//2.70E-16//68aa//67%//P12714 

KIDNE20053360//Homo sapiens antigen NY-CO-31 (NY-CO-31) mRNA, partial cds.//6.40E-12//66aa//54%// 

AF039697 

KIDNE20054000 

KIDNE20054770//Drosophilamelanogasterminidiscs(mnd) mRNA, complete cds.//4.80E-69//474aa//34%//AF139834 
'5 KIDNE20056290//BOS taurus mRNA for mitochondrial aralkyi acylGoA:amino acid N-acyltransferase.//6.40E-58// 
297aa//40%//AJ223301 

KIDNE2Q056760//NEURONAL PROTEIN.//6.50E-44//118aa//75%//P41737 
KIDNE20059080//Plakophilin 4 [Homo sapiens].//0//669aa//98%//NP_003619 
KIDNE20059370 

20 KIDNE20060140//Rattus norvegicus selective LIM binding factor mRNA, complete cds.//1 .30E-255//339aa//94%// 
AF226993 

KIDNE20060300//Gallus gallus syndesmos mRNA, complete cdsy/3.10E-42//149aa//62%//AF095446 
KIDNE20060530//MUS musculus mRNA for acetylglucosaminyltransferase-like protein.//3.70E-252//633aa//69%// 
AJ006278 
25 KIDNE20060620 

KIDNE20061490//Xenopus laevis RING finger protein mRNA, complete cds.//6.80E-19//136aa//40%//U6381 7 

KIDNE20062480 

KIDNE20062990//B0B1 PROTEIN (BEM1 -BINDING PROTEIN).//8.00E-06//332aa//22%//P38041 
KIDNE20063530 

30 KIDNE20063760//GAMMA-GLUTAMYLTRANSPEPTIDASE 1 PRECURSOR (EC 2.3.2.2) (GAMMA-GLUTAMYL- 
TRANSFERASE 1 ).//7,70E-20//62aa//83%//P1 9440 
KIDNE20066520 

KIDNE20067600//PROSTAGLANDIN F2-ALPHA RECEPTOR REGULATORY PROTEIN PRECURSOR 
(PROSTAGLANDIN F2-ALPHA RECEPTOR ASSOCIATED PR0TEIN).//1 .90E-23//293aa//26%//Q62786 
35 KIDN E20067750//Homo sapiens PTOV1 (PT0V1 ) gene, complete cds.//7.60E-62//283aa//53%//AF238381 
KIDNE20068800//ACTIN INTERACTING PROTEIN 2.//4.60E-33//143aa//51%//P46681 

KIDNE20070050 
KIDNE20070770 
KIDNE20071860 

40 KIDN E20073280//L.mexicana Imsap2 gene for secreted acid phosphatase 2 (SAP2).//3.30E-05//365aa//23%//Z46970 
KIDNE20073520//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//3.00E-1 5//536aa//23%//P08640 
KIDNE20073560 
KIDNE20074220 

45 KIDNE20075690//CLAUDIN-1 0 (OSP LIKE PROTEIN).//6.60E-90//224aa/A77%//P78369 

KIDNE20078100//DEOXYURIDINE 5'-TRI PHOSPHATE NUCLEOTIDOHYDROLASE (EC 3.6.1.23) (DUTPASE) 
(DUTP PYROPHOSPHATASE).//7.80E-1 6//1 00aa//44%//041 033 
KIDNE20078110//ZING FINGER PROTEIN 202.//3.10E-67//427aa//40%//095125 
LIVER1 0000580 

50 LIVER1 0000670//UROCANATE HYDRATASE (EC 4.2.1 .49) (UROCANASE) (IIVIIDAZOLONEPROPIONATE HYDRO- 
LASE) .//1 .50 E-87//380aa//35%//P53385 

LIVER10000790//Rattus norvegicus fertility related protein WMP1 mRNA, complete cds.//2.70E-82//254aa//65%// 

AF094609 
LIVER1 0000990 

55 LIVER10001040//Rattus norvegicus kidney-specific protein (KS) mRNA, complete cds.//6.00E-149//333aa//79%// 
AF062389 
LIVER10001110 
LIVER1 0001 750 
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LIVER10002300//NADH-UBIOUINONE OXIDOREDUCTASE 51 KD SUBUNIT PRECURSOR (EC 
1.6.5.3) (EC 1.6.99.3) (COMPLEX 1-51 KD) (CI-51KD).//1.80E-99//183aa//100%//P49821 
LIVER1 0002780 

LIVER1 0003030 

5 LIVER1 0004330//Homo sapiens mRNA for neuropathy target esterase.//1 .30E-256//71 0aa//68%//'AJ004832 
LIVER10005420//MUS musculus TAGL-alpha mRNA, complete cdsy/1 .20E-204//373aa//76%//AF1 49837 
LIVER20000330//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1 , ENDOTHELIAL (B12 PROTEIN).// 
1 .80E-1 0//1 93aa//27%//Q1 3829 

LIVER20000370//ALPHA-1 B-GLYCOPROTEIN.//5.00E-1 59//304aa//96%//P0421 7 

10 LIVER20004160 

LIVER20004460 

LIVER20005150 

MAMGLI 0000320 

MAMGL1 0000350 
15 MAMGL1 0000560 

MAMGL10001780//PUTATIVE PRE-MRNA SPLICING FACTOR RNA HELICASE (DEAH BOX 

PROTEIN 15) (ATP-DEPENDENT RNA HELICASE #46).//2.10E-BO//166aa//92%//043143 

MAMGL10001820//SEGMENT POLARITY PROTEIN DISHEVELLED.//1 .10E-12//84aa//41%//P51140 

MAMGLI 0001 840 

20 MESAN10000350//MAJOR SURFACE-LABELED TROPHOZOITE ANTIGEN PRECURS0R,//2.1 0E-06//1 79aa// 
29%//P21 849 

MESAN 10001 01 0//Rat trg gene product//6.00E-94//600aa//36%//l60486 
MESAN10001470 

MESAN10001800//BB1=mallgnant cell expression-enhanced gene/tumor progression-enhanced gene [human, 
25 UM-UC-9 bladder carcinoma cell line, mRNA, 1 897 nt].//1 .60E-1 62//348aa//87%//S82470 

MESAN20000920//Guanylate kinase-lnteracting protein 1 Maguin-1 , membrane-assoclated//7.60E-155//477aa//62%// 
T18293 

MESAN20001490//HYPOTHETICAL 175.8 KD PROTEIN IN GND1-IKI1 INTERGENIC REGION.//6.70E-163//346aa// 

58%//P38a73 
30 MESAN20002670 

MESAN20002910//HISTIDYL-TRNA SYNTHETASE (EC 6.1.1.21) (HISTIDINE— TRNA LIGASE) (HISRS).//3.60E- 

98//202aa//96%//P12081 

MESAN20003370 

MESAN20005010//Homo sapiens DNA cytoslne methyltransferase 3 alpha (DNMT3A) mRNA, complete cds.//6.60E- 
35 09//95aa//33%//AF067972 

NB9N410000470//Homo sapiens NY-REN-45 antigen mRNA, complete cds.//9.70E-247//250aa//99%//AF155110 

NB9N410001210 

NB9N410001350//RAS-RELATED PROTEIN RAB-1A (YPT1 -RELATED PR0TEIN).//1 .00E-70//109aa//100%// 

PI 1476 
40 NB9N410001460 
NB9N420000420 

NB9N420001040//MUS musculus She binding protein (mPAL) mRNA, complete cds.//4.40E-286//672aa//77%// 
AF017152 

NB9N420004950//PROBABLE NUCLEAR ANTIGEN.//5.00E-05//246aa//31 %//P33485 
45 NESOP10000870//HOMEOBOX PROTEIN SAX-1 (CHOX-3) (FRAGMENT),//1 ,70E-05//88aa//38%//P19601 

NHNPC10000840//Homo sapiens poly-U binding splicing factor PUF60 (PUF60) mRNA, partial cds.//6.80E-196// 

380aa//99%//AF1 90744 

NHNPC10001010 

NHNPC10001240//PAIRED MESODERM HOMEOBOX PROTEIN 2A (PAIRED-LIKE HOMEOBOX 2A) (PH0X2A 
50 HOMEODOMAIN PROTEIN).//B.00E-05//1 09aa//28%//Q62066 NHNPC20002060//Bovlne viral diarrhea virus type 2 
strain BVDV2-SD1630C polyprotein gene, partial cds.//8.70E-77//153aa//92%//AF268178 

NHNPC20002120//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//2.20E-130//357aa//63%//P51522 

NT2NE1 0000040 

NT2NE1 0000 140//Schizosaccharomycespombe caffeine-Induced death protein 1 (cldl)mRNA, complete cds.//1 .OOE- 
55 3 1 //350aa//29%//AF1 05076 

NT2NE1 00001 80//SUPPRESSOR PROTEIN SRP40.//2.50E-06//219aa//23%//P32583 
NT2NE1 0000230 

NT2NE10000630//Gallus gallus Dach2 protein (Dach2) mRNA, complete cds.//1 .90E-147//194aa//78%//AF1 98349 
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NT2NE10000730//RAB GERANYLGERANYLTRANSFERASE ALPHA SUBUNIT (EC 2.5.1 .-) (RAB GERANYL-GER- 
ANYLTRANSFERASE ALPHA SUBUNIT) (RAB GG TRANSFERASE) (RAB 
GGTAS E) .//3 .30 E-07//1 42aa//33%//Q92696 

NT2NE10000830//POSSIBLE GUSTATORY RECEPTOR CLONE PTE01 (FRAGMENT).//2.40E-56//182aa//62%// 

5 P35894 

NT2NE1 0001 200 
NT2NE1 0001 630 

NT2NE10001850//UDP-N-ACETYLGLUCOSAMINE-PEPTIDE N-ACETYLGLUCOSAMINYLTRANSFERASE 110 
KDA SUBUNIT (EC 2.4.1 .-) (O-GLCNAC TRANSFERASE P110 SUBUNIT).//6.80E-30//395aa//28%//P56558 
10 NT2NE20000380 
NT2NE20000560 
NT2NE20000640 
NT2NE20001740 

NT2NE20002140//DUAL SPECIFICITY PROTEIN PHOSPHATASE 8 (EC 3.1 .3.48) (EC 
15 3.1.3.16) (NEURONAL TYROSINE THREONINE PHOSPHATASE 1).//1 .00E-131//487aa//51%//009112 

NT2NE20002590//OOCYTE ZINC FINGER PROTEIN XLCOF6.1 (FRAGMENT).//6.30E-30//77aa//53%//P1 8750 
NT2NE20002990//69 KD ISLET CELL AUTOANTIGEN (ICA69) (ISLET CELL AUTOANTIGEN 1).//3.30E-113//335aa// 
57%//005084 

NT2NE20003270//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
20 (1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//7.00E-21//739aa//22%//P08640 

NT2NE20003690//PROPIONYL-COA CARBOXYLASE ALPHA CHAIN PRECURSOR (EC 6.4.1 .3) (PCCASE) (PRO- 
PANOYL-COA:CARBON DIOXIDE LIGASE).//1.80E-26//68aa//86%//P05165 

NT2NE20003840//MICRONUCLEAR LINKER HISTONE POLYPROTEIN (MIC LH) [CONTAINS: LINKER HISTONE 
PROTEINS ALPHA, BETA, DELTA AND GAMMA].//7.80E-10//366aa//22%//P40631 
25 NT2NE20003920 

NT2NE20004550//sdk (sidekick) protein//4.90E-11//1 77aa//31%//T13924 
NT2NE20004700 

NT2NE20005170//Homo sapiens ciliary dynein heavy chain 9 {DNAH9) mRNA, complete cds.//1.20E-103//226aa// 
90%//AF257737 

30 NT2NE20005360//40S RIBOSOMAL PROTEIN SA (P40) (34/67 KD LAMININ RECEPTOR) 

(COLON CARCINOMA LAMININ-BINDING PROTEIN) (NEM/1 CHD4).//1 ,50E-47//91 aa//98%//P08B65 
NT2NE20005500 

NT2NE20005860//Rattus norvegicus endo-alpha-D-mannosidase (Enman) mRNA, complete cds.//1 .70E-85//207aa// 
69%//AF023657 
35 NT2NE20006360 

NT2NE20006580//Homo sapiens mRNA for RET finger protein-like 2//1 .10E-152//2B8aa//98%//AJ010231 

NT2NE20007060 
NT2NE20007630 
NT2NE20007870 
40 NT2NE20008020 

NT2NE20008090//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//1.90E-207//511aa//71%// 

Q03923 

NT2NE20009800 
NT2NE20011560 
45 NT2NE20012470 
NT2NE20013240 

NT2NE20013370//Homo sapiens estrogen-responsive B box protein (EBBP) mRNA, complete cds.//4.30E-208// 

394aa//97%//AF096B70 

NT2NE20013640 

50 NT2NE2001 3720//Homo sapiens mRNA for putative ribuiose-5-phosphate-eplmerase, partial cds.//7.90E-58//1 1 6aa// 
98%//AJ224326 
NT2NE200 14030 

NT2NE20014280 
NT2NE20014350 
55 NT2NE20015300 
NT2NE20016230 

NT2NE20016260//Homo sapiens G-protein coupled receptor RE2 mRNA, complete cds.//2.00E-148//270aa//100%// 
AF091890 
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NT2NE20016340//NADH-UBIQUINONE OXIDOREDUCTASE 9 KD SUBUNIT PRECURSOR (EC 1.6.5.3) (EC 
1.6.99.3) (COMPLEX I-9KD) (CI-9KD).//5.50E-26//86aa//68%//P56181 
NT2NE200 16480 

NT2NE20016660//PUTATIVE ATP-DEPENDENT RNAHELICASE YDL031 W.//3.90E-15//1 76aa//30%//Q1 2389 
5 NT2NE20016970//MSF1 PROTEIN.//3.00E-23//169aa//34%//P35200 

NT2NE20034080//Rattus norvegicus neurestin alpha mRNA, complete cds.//3.70E-258//449aa//99%//AF086607 
NT2NE20035690//Hoino sapiens phosphoinositol 3-phosphate-binding protein-2 (PEPP2) mRNA, complete cds.// 
1 .60E-1 80//227aa//98%//AF3021 50 
NT2NE20044900 

10 NT2NE200471 60//Homo sapiens AD-01 7 protein mRNA, complete cds.//2.70E-91//357aa//47%//AF1 5731 8 
NT2NE20053710 

NT2NE20054410//SPLICEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//4.50E-06//121aa//33%//Q15427 
NT2NE20055170//Homo sapiens torsinA (DYT1) mRNA, complete cds.//9.50E-159//232aa//89%//AF007871 
NT2NE20057200//MUS musculus Ubc6p homolog mRNA, complete cds.//3.80E-108//222aa//91%//U93242 
15 NT2RI1 00001 60 
NT2RI1 0000270 

NT2RI100004ao//Homo sapiens MKP-1 lil<e protein tyrosine phosphatase mRNA, complete cds.//8.30E-49//1 79aa// 
50%//AF038844 
NT2RI1 0001 640 
20 NT2RI20000640 
NT2RI20002700 

NT2RI20002820//NUCLEOLAR AUTOANTIGEN N055.//5.00E-243//437aa//1 00%//Q92791 
NT2RI20002940 

NT2RI20003410//ZINC FINGER PROTEIN 43 (ZINC PROTEIN HTF6).//9.30E-226//578aa//70%//P28160 
25 NT2RI200041 20//CREB-BINDING PROTEIN.//4.40E-05//1 70aa//30%//Q92793 
NT2RI20004210//ZINC FINGER PROTEIN 75.//1 .10E-96//225aa//76%//P51815 

NT2RI20005970 

NT2RI20006690//TRICHOHYALIN.//7.10E-17//222aa//32%//P37709 
NT2RI20006710 

30 NT2RI20006850//HISTONE H1.2 (HI VAR. 1) (H1C).//3.30E-05//154aa//25%//P15864 
NT2RI200073aO 
NT2RI20008650 
NT2RI20009740 

NT2RI2001 01 00//FATTY ACYL-COA HYDROLASE PRECURSOR, MEDIUM CHAIN (EC 3.1 .2.14) (THIOESTERASE 
35 B) .//1 . 1 0 E-1 1 4//425aa//46%//Q0479 1 

NT2RI20010830//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.30E-174//554aa//51%//P51523 
NT2RI20010910//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 

3.40E-22//1 95aa//36%//Q081 70 
NT2RI20012350 
40 NT2RI20012440 

NT2RI20013420//l\/lus musculus cyclin ania-6b mRNA, partial cds.//2.70E-73//163aa//91%//AF211859 
NT2RI20013850//Homo sapiens P38IP (P38iP) mRNA. complete cds.//7.70E-101//213aa//95%//AF093250 
NT2RI2001 4090//DYSTROPHIN.//3,60E-1 5//546aa//20%//P1 1 531 
NT2RI20014100 

45 NT2RI2001 4490//MUS musculus retlnolc acld-responslve protein (Stra6) mRNA, 
complete cds.//1 .30E-263//672aa//73%//AF062476 
NT2RI2001 4500//TRICHOHYALIN.//4.80E-1 9//61 0aa//23%//Q07283 

NT2RI20015190//Homo sapiens misato mRNA, partial cds.//7.60E-149//271aa//100%//AF272833 
NT2RI2001 5400//Alcohol dehydrogenase/rlbltol dehydrogenase//4.30E-1 07//469aa//44%//AAB93456 
50 NT2RI20015950//ZeamayscloneAGPZm1 arablnogalactan protein (agp)mRNA, partial cds.//5.60E-05//180aa//32%// 

AF1 34579 

NT2RI20016210//Probable transposase - human transposon MER37//3.50E-19/'/156aa//35%//S72481 

NT2RI20016570 
NT2RI20017260 

55 NT2RI2001 8460//basic domain/leuclne zipper transcription factor//3.00E-52//203aa//59%//AAA65688 

NT2RI20018660//MUS musculus erythrold membrane-associated protein ERMAP (Ermap) mRNA, complete cds.// 
9.20E-1 87//385aa//72%//AF1 53906 

NT2RI20020220/A/ARIANT-SURFACE-GLYCOPROTEIN PHOSPHOLIPASE C (EC 3.1. 4.47) (VSG LIPASE) (GLYC- 
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OSYLPHOSPHATIDYLINOSITOL-SPECIFIC PHOSPHOLIPASE C) (GPI-PLC).//5.60E-21//271aa//27%//015886 
NT2RI20020410//SALIVARY PROLINE-RICH PROTEIN PO (ALLELE K) [CONTAINS: PEPTIDE P-D] (FRAGMENT).// 
1 .20E-05//1 27aa//32%//P1 01 62 
NT2RI20021520 
5 NT2RI20022430 
NT2RI20022520 

NT2RI20022700//X123 protein//7.20E-80//1 65aa//97%//1 68673 

NT2RI20025170//Homo sapiens PARS (PARS) mRNA, complete cds.//1 .30E-113//S73aa//45%//AF252293 
NT2R120025300//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
10 (1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//1 .40E-14//628aa//21%//P08640 
NT2R1 2002541 0//ZINC FINGER PROTEIN 135.//3.90E-72//301aa//42%//P52742 

NT2R120025540//NUCLEAR AUTOANTIGENIG SPERM PROTEIN (NASP).//1 .1 E-312//604aa//98%//P49321 
NT2R120025850//PUTATIVE 90.2 KD ZINC FINGER PROTEIN IN CCA1-ADK2 INTERGENIC REGI0N.//1 .40E-73// 
246aa//42%//P39956 
15 NT2RI20026540 
NT2RI20028020 
NT2RI2002a520 

NT2RI20029260//ARP2/3 COMPLEX 16 KDA SUBUNIT (P16-ARC).//1 .00E-52//154aa//68%//015511 
NT2RI20029580//Homo sapiens mRNA for copine VI proteln.//1 .00E-207//425aa//73%//AJ1 33798 
20 NT2RI20029700 

NT2RI20030110//MUS musculus clone:2-65 mRNA, complete cds.//2.30E-28//124aa//50%//AB030198 

NT2RI20030190 

NT2RI20030510 

NT2RI20030670 

25 NT2RI20031540//DXS6673E PROTEIN.//2.80E-05//240aa//22%//Q14202 NT2RI20032050//Homo sapiens 
transportin2 mRNA, complete cds.//0//663aa//98%//AF019039 

NT2RI20032220//INTRACELLULAR PROTEIN TRANSPORT PROTEIN US01 .//4.90E-16//582aa//23%//P25386 
NT2RI200S3010//Homo sapiens UDP-GlcNAc:a-3-D-mannoslde b1,2-N-acetylglucosamlnyltransferase 1.2 
(MGAT1 .2) mRNA, partial cds.//1 .40E-293//579aa//93%//AF250859 
30 NT2RI20033040 
NT2RI20033380 

NT2RI20033440//PRESYNAPTIC PROTEIN SAP97 (SYNAPSE-ASSOCIATED PROTEIN 97) (DISCS, LARGE HO- 
MOLOG 1 ).//2.20E-0B//1 2Baa//36%//Q1 2959 

NT2R120033830//Homo sapiens SGC32445 protein (SGC32445) mRNA, complete cds.//1 .80E-67//134aa//100%// 
35 AF251041 

NT2RI20035560 

NT2RI20036780//SERINE PROTEASE PG6 PRECURSOR (EC 3. 4. 21. -) (SUBTILISIN/KEXIN-LIKE PROTEASE 
PCS) (CONVERTASE PC5).//0//633aa//97%//P41 41 3 
NT2RI20036950//TRICHOH YALIN.//1 .70E-1 3//31 3aa//25%//P37709 
40 NT2RI200S7510//FORMAMIDOPYRIMIDINE-DNA GLYCOSYLASE (EC 3.2.2.23) (FAPY-DNA GLYCOSYLASE).// 
1 .20E-05//238aa//28%//P74290 
NT2RI20040590 

NT2RI20041900//REGULATOR OF MITOTIC SPINDLE ASSEMBLY 1 (RMSA-1).//3.20E-13//108aa//44%//P49646 
NT2RI20042840 

45 NT2RI20043040//Homo sapiens NY-REN-2 antigen mRNA, complete cds.//2.80E-188//539aa//65%//AF1 56095 

NT2RI20043980 

NT2RI20044420 

NT2RI20046060 

NT2RI20047830 
50 NT2RI20048400 

NT2RI20049160 

NT2RI20049840 

NT2RI20049850 

NT2RI20050610 

55 NT2RI20050870//Homo sapiens putative anion transporter 1 mRNA, complete cds.//8.10E-262//533aa//96%// 

AF279265 

NT2RI20051500//MUS musculus ST6GalNAc V mRNA for GDI alpha synthase, complete cds.//2.40E-168//336aa// 
90%//AB030836 
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NT2RI20053350//DNA REPAIFVTRANSCRIPTION PROTEIN MET18/MMS19.1/2.00E-23//234aa//32%//P40469 
NT2RI20053680//Homo sapiens NY-REN-36 antigen mRNA, partial cds.//5.80E-62//124aa//100%//AF155106 
NT2RI20055640//MUS musculus mRNA for ganglioside-induced differentiation associated protein 1.//2.50E-100// 

319aa//58%//Y17850 
5 NT2RI20056280 

NT2RI20056470//KERATIN, TYPE II CYTOSKELETAL 4 (CYTOKERATIN 4) (K4) (CK4).//2.40E-278//534aa//99%// 

P19013 

NT2RI20057230//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 
3.00E-28//241 aa//36%//Q081 70 
10 NT2RI20058110//CELL DIVISION CONTROL PROTEIN 25.//3.30E-18//419aa//25%//P04821 
NT2RI2005B510//PLECTIN,//3.50E-07//551aa//21%//P30427 

NT2RI20060710//ZINC FINGER PROTEIN ZIC4 (ZINC FINGER PROTEIN OF THE CEREBELLUM 4).//2.10E-153// 
312aa//86%//Q61467 

NT2RI20060720//HYPOTH ETICAL PROTEIN KIAA01 79.//0//692aa//99%//Q1 4684 
15 NT2RI20061270 

NT2RI20061830//Proline-rich protein M14 precursor//1.50E-17//170aa//37%//A28996 
NT2RI200621 QO//Mus nnusculus shd mRNA, complete cds.//1 .00E-137//337aa//77%//AB018423 
NT2RI20063450 

NT2RI20064120//Rattus non/egicus mRNAfor CDCrel-1A, complete cds.//6.70E-148//280aa//98%//AB0271 43 
20 NT2RI20064870 

NT2RI20065060//Drosophila melanogaster rudimentary gene, intron 3; anon-15AB gene, complete cds.//1 .40E-07// 
21 2aa//23%//AF1 72941 
NT2RI20065530 
NT2RI20066670 
25 NT2RI20066790 

NT2RI20066820//Human WW domain binding protein-l mRNA, complete cds.//5.70E-46//181aa//46%//U79457 
NT2RI20067030//Homo sapiens nolp mRNA, complete cds.//1 .80E-85//406aa//51%//AB017800 
NT2RI20067350//Neofelis nebulosa strain nnex zinc finger protein Zfx (Zfx) gene, partial cds.//2.1 0E-23//245aa//27%// 

AF252979 
30 NT2RI20067880 
NT2RI20068250 

NT2RI20068550//Homo sapiens RNA helicase (RIG-I) mRNA, complete cds.//1 .20E-52//340aa//34%//AF038963 
NT2RI20070480//l\/lus musculus DXImx48e protein (DXImx48e) mRNA, complete cds.//3.40E-169//467aa//72%// 
AF229644 
35 NT2RI20070840 

NT2RI20070960//PROTO-ONCOGENE DBL PRECURSOR [CONTAINS: MCF2].//4. 10E-30//431aa//27%//P10911 

NT2RI20071160 

NT2RI20071330//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//2.B0E-240//647aa//64%// 

Q05481 
40 NT2RI20071480 
NT2RI20072140 

NT2RI20072540//Arabidopsis thaliana ZCF61 mRNA, complete cds.//2.20E-12//11 3aa//35%//AB028228 
NT2RI20073030 

NT2RI20073840//Homo sapiens mixed lineage kinase mRNA, complete cds.//2.30E-179//362aa//93%//AF238255 
45 NT2RI20073860 

NT2RI20074390/yZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//2.30E-97//489aa//41%//P51523 
NT2RI20074690//Homo sapiens NY-REN-58 antigen mRNA, complete cds.//4.00E-221//438aa//99%//AF155115 
NT2RI20074980//Homo sapiens carboxypeptidase Z precursor, mRNA, complete cds.//3.10E-189//357aa//96%// 
U83411 

50 NT2RI20075070 
NT2RI20075720 
NT2RI20075890 

NT2RI20077230//Homo sapiens BRI3 mRNA, complete cds.//1 .80E-114//1 82aa//99%//AF272043 
NT2RI20077290 
55 NT2RI20077510 

NT2RI20077540//INTESTINAL MEMBRANE A4 PROTEIN (DIFFERENTIATION-DEPENDENT 
PROTEIN A4) (PROTEOLIPID PROTEIN 2).//2.30E-12//111aa//33%//Q04941 
NT2RI20078270 
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NT2RI2007B790//HOMEOBOX PROTEIN H0X-A4 (CHOX-1.4).//7.80E-08//83aa//43%//P17277 
NT2RI20078840//ARS BINDING PROTEIN 1 .50E-17//313aa//27%//P49777 
NT2RI20078910//DMR-N9 PROTEIN.//1 .40E-122//398aa//59%//Q08274 

NT2RI20080500//BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE 
5 PROTEIN PRECURSOR (HSPG) (PERLECAN) (PLC).//5.90E-43//528aa//28%//P981 60 

NT2RI20081880//MUS musculus Mporc-b mRNAfor porcupine-B, complete cds.//4.40E-64//125aa//97%//AB036746 
NT2RI20082210//CORNIFIN B (SMALL PROLINE-RICH PROTEIN 1B) (SPR1B) (SPR1 B).//4.70E-1 2//11 0aa//37%// 
Q62267 

NT2RI20083360 

10 NT2RI20083960//Homo sapiens mRNA for SH3 binding protein, complete cds.//3.60E-31//159aa//44%//AB005047 
NT2RI20084810//1-ACYL-SN-GLYCEROL-3-PHOSPHATE ACYLTRANSFERASE ALPHA (EC 2.3.1.51) (1- AGP 
ACYLTRANSFERASE) (1-AGPAT) (LYSOPHOSPHATIDIC ACID 
ACYLTRANSFERASE- ALPHA) (LPAAT-ALPHA).//4.70E-60//11 4aa//99%//Q99943 

NT2RI20085260 

15 NT2RI200859aO//MILK FAT GLOBULE-EGF FACTOR 8 PRECURSOR (MFG-EB) (MGP57/53) 

(PAS- 6/PAS-7 GLYCOPROTEIN) (MFGM) (SPERM SURFACE PROTEIN SP47) (BP47) (COMPONENTS 15/16).// 

5 .40 E-3 1 //1 74aa//41 %//Q95 1 1 4 

NT2RI20086560 

NT2RI200871 40//Homo sapiens PR-domain zinc finger protein 5 (PRDM5) mRNA, complete cds.//6.1 0E-51//1 01 aa// 
20 99%//AF272897 

NT2Ri20087490//SPLiCEOSOME ASSOCIATED PROTEIN 49 (SAP 49) (SF3B53).//2.00E-08//156aa//37%//Q15427 
NT2Ri20087910//PROBABLE URACIL PHOSPHORIBOSYLTRANSFERASE (EC 2.4.2.9) (UMP PYROPHOSPHOR- 
YLASE) (UPRTASE).//1 .80E-42//159aa//54%//0 13867 
NT2RI20088010 

25 NT2RI2008B1 20//AXONEME-ASSOCIATED PROTEIN MST1 01 (2).//4.00E-05//1 85aa//24%//Q08696 

NT2RI20089420//NEURAL CELL ADHESION MOLECULE LI PRECURSOR (N-CAM L1).//3.00E-07//104aa//31%// 
Q05695 

NT2Ri20090650//ZINC FINGER PROTEIN 26 (ZFP-26) (MKR3 PROTEIN) (FRAGMENT).//2.10E-30//194aa//37%// 
PI 0076 

30 NT2Ri20090660//PLEGTIN.//7.20E-14//450aa//24%//P30427 
NT2Ri20090830 
NT2Ri20091440 

NT2RI20092150//ZiNG FINGER PROTEIN 165.//3.10E-46//179aa//56%//P49910 

NT2RI20092890//CARBOXYPEPTIDASE N 83 KD CHAIN (CARBOXYPEPTIDASE N REGULATORY SUBUNIT) 
35 (FRAGMENT).//3.70E-29//293aa//31%//P22792 

NT2RI20094060//Homo sapiens reo mRNA, complete cds.//1 .40E-112//293aa//62%//AB023584 
NT2RP60000080//Homo sapiens Pig11 (PIG11) mRNA, complete cds.//6.60E-38//117aa//71%//AF01 03 15 

NT2RP60000170 
NT2RP60000320 

40 NT2RP60000350//Homo sapiens mRNA for SH3 binding protein, complete cds.//1 ,30E-54//253aa//45%//AB005047 
NT2RP60000390 
NT2RP60000590 

NT2RP60000720//Pinus taeda clone PtaAGPB putative arabinogalactan protein mRNA, complete cds,//1 .50E-05// 
165aa//29%//AF101785 

45 NT2RP60000860//Homo sapiens mRNA for NICE-5 protein. //6.30E-1 92//883bp//99%//AJ243666 
NT2RP60001000//ZINC FINGER PROTEIN 41 (FRAGMENT).//7.10E-128//366aa//59%//P51814 
NT2RP60001090//RING CANAL PROTEIN (KELCH PROTEIN).//6.40E-79//553aa//33%//Q04652 
NT2RP60001230//KINESIN LIGHT CHAIN (KLC).//1 .10E-206//566aa//69%//Q07866 

NT2RP60001270//ZINC FINGER PROTEIN ZIC4 (ZINC FINGER PROTEIN OF THE CEREBELLUM 4).//3.70E-131// 
50 264aa//67%//Q61467 
NT2RP70000410 

NT2RP70000690//MUCiN 1 PRECURSOR (POLYMORPHIC EPITHELIAL MUCIN) (PEM) (PEMT) (EPiSIALIN) (TU- 
MOR-ASSOCIATED MUCIN) (CARCINOMA-ASSOCIATED MUCIN) (TUMOR-ASSOCIATED EPITHELIAL MEM- 
BRANE ANTIGEN) (EMA) (H23AG) (PEANUT- REACTIVE 
55 URINARY MUCIN) (PUM) (BREAST CARCINOMA-ASSOCIATED ANTIGEN DF3).//2.00E-26//911aa//25%//P15941 
NT2RP70000760//TRANSLATION INITIATION FACTOR EIF-2B EPSILON SUBUNIT (EIF-2B GDP-GTP EXCHANGE 
FACTOR) .//1 .70E-23//66aa//81 %//P47823 

NT2RP70002380//Adlantum caplllus-venerls AcExtl mRNA for Extensin, complete cds.//2.00E-05//93aa//37%// 
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AB008227 

NT2RP70002590//HYPOTHETICAL32.0KDA PROTEIN IN NNF1-STE24INTERGENICREGION.//2.20E-12//251aa// 
28%//P47153 

NT2RP70002710//MUS musculus zinc finger protein 276 C2H2 type (Zfp276) mRNA, complete cds.//1.80E-101// 

5 253aa//77%//AF1 78935 
NT2RP70003640 
NT2RP70003910 

NT2RP70004250//MYOSIN HEAVY CHAIN, NONMUSCLE TYPE B (CELLULAR MYOSIN HEAVY CHAIN, TYPE B) 
(NMMHC-B).//4.30E-08//244aa//24%//P35580 
10 NT2RP70004770//UDP-N-ACETYLGLUCOSAMINE-PEPTIDE N-AGETYLGLUCOSAMINYLTRANSFERASE 110 
KDA SUBUNIT (EG 2,4.1 .-) (O-GLCNAC TRANSFERASE P110 SUBUNIT).//1 .00E-22//213aa//32%//P56558 
NT2RP70005790 

NT2RP70006240//Phosphatidylinositol-4-phosphate 5-klnase homolog T3K9.2//1 .90E-1 6//204aa//30%//T02098 
NT2RP70008120//HOMEOBOX PROTEIN H0X-B9 (HOX-2.5).//2.40E-53//117aa//87%//P20615 
15 NT2RP70009060//Medicago tmncatula mRNA for B5p protein (85p gene).//5.10E-07//229aa//23%//AJ249679 
NT2RP70010800//MUS musculus mRNA for Mill (Miwi like), compiete cds.//2.40E-280//614aa//83%//AB032605 
NT2RP70011660//P CATION-TRANSPORTING ATPASE C10C6.6 iN CHROMOSOIVIE iV (EC 3,6.1.-). //0//1i65aa// 
53%//P90747 
NT2RP70012310 

20 NT2RP70013060//U1 SMALL NUCLEAR RIBONUCLEOPROTEiN 70 KDA (U1 SNRNP 70 
KDA)y/1 .50E-30//241 aa//32%//'P09406 
NT2RP70013350 

NT2RP70015910//bK57G9.1 (novel Kringle and CUB domain protein) [Homo sapiens].//1 .00E-140//247aa//95%// 
CAB62952 

25 NT2RP7001 8560//MUS musculus polyhomeotic (mPh2) mRNA, complete cds.//5.00E-232//465aa//91 %//U81 491 
NT2RP70021510 

NT2RP70022430/A"ax1 -binding protein TRX - human.//6.00E-71//180aa//93%//S68091 
NT2RP70023760//M PROTEIN, SEROTYPE 2.1 PRECURSOR.//2.20E-13//331aa//25%//P50468 
NT2RP70023790//110 KDA ANTIGEN (PK110) (FRAGMENT).//7.00E-07//162aa//23%//P13B13 

30 NT2RP70024490 

NT2RP70024500//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2),//4,80E-66//312aa//38%//P51523 
NT2RP70025540 

NT2RP70026190//MUS musculus ubiqultln-protein ligase E3-alpha (Ubri) mRNA, complete cds.//6.20E-305//597aa// 
93%//AF061555 

35 NT2RP70028290//Scm-related gene containing four mbt domains [Mus musculus].//6.00E-59//500aa//31%// 
NP_062333 
NT2RP70028410 

NT2RP70028750//RESTIN (CYTOPLASMIC LINKER PROTEIN-170 ALPHA-2) (CLIP-170) 
(REED- STERNBERG INTERMEDIATE FiLAMENT ASSOCIATED PROTEIN).//2.60E-11//87aa//44%//P30622 
40 NT2RP70029060//HEAT SHOCK PROTEIN MSP 90-ALPHA (MSP 

86).//0//731 aa//99%//P07900 

NT2RP70029820//Homo sapiens GR0S1-L protein mRNA, complete cds.//2.40E-177//680aa//51%//AF097432 
NT2RP70030500 
NT2RP70030550 
45 NT2RP70030910 

NT2RP70032030//ZING FINGER PROTEIN 184 (FRAGMENT).//3.50E-139//366aa//55%//Q99676 
NT2RP70033040//YceA protein homolog ybfQ - Bacillus subtills.//1 .00E-35//300aa//33%//G69750 
NT2RP70036290//MHG CLASS II TRANSACTIVATOR CIITAy/4.80E-09//116aa//31%//P33076 
NT2RP70036320//Mlcrofllarlal sheath protein//5.00E-06//92aa//35%//S46966 
50 NT2RP70036470 

NT2RP70036800//RING CANAL PROTEiN (KELCH PROTEIN),//4,50E-1 07//652aa//38%//Q04652 
NT2RP70039600 

NT2RP70040800//CELL SURFACE GLYCOPROTEIN 1 PRECURSOR (OUTER LAYER PROTEiN B) (S-LAYER PRO- 
TEIN 1 ) ,//5,60E-20//307aa//28%//Q06852 
55 NT2RP70042040//ZINC FINGER PROTEIN MLZ-4 (ZiNC FiNGER PROTEiN 46),//9,20E-61//254aa//46%//Q03309 
NT2RP70042330//HYPOTHETICAL PROTEIN MJ0941 .//8.80E-06//133aa//24%//057711 

NT2RP70042600//MYOSIN HEAVY CHAIN, NONMUSCLE TYPE B (CELLULAR MYOSIN HEAVY CHAIN, TYPE B) 
(NM M HC-B) .//4.50 E-2 1 //71 5aa//21 %//P35580 
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NT2RP70043730 

NT2RP70043960//MUS musculus mRNA for Fish protein.//0//866aa//88%//AJ007012 
NT2RP70045410//MUS musculus MGA protein mRNA, compiete cds.//2.70E-265//1040aa//57%//AF205935 
NT2RP70046560//PEREGRIN (BR140 PR0TEiN).//1 .30E-48//304aa//36%//P55201 
5 NT2RP70046870/yVEGETATiBLE iNCOMPATIBiLITY PROTEIN HET-E-1 .//9.70E-07//395aa//24%//Q00808 
NT2RP70047510 
NT2RP70047660 
NT2RP70047900 

NT2RP70049150//MUS musculus mRNA for UBE-1c1, UBE-1c2, UBE-1c3, complete cds.//6.60E-56//209aa//52%// 
10 AB030505 

NT2RP70049250/A/ASODILATOR-STIMULATED PHOSPHOPROTEIN (VASP).//1 .90E-08//1 1 8aa//33%//P50552 
NT2RP70049750 

NT2RP70052050//Human transformation-related protein mRNA, 3' end.//2.20E-12//74aa//52%//L24521 
NT2RP70052190 
15 NT2RP700546BO 
NT2RP70054930 

NT2RP70055020//Homo sapiens mRNA for paraplegln-IIke proteln.//3.00E-29//68aa//94%//Y18314 
NT2RP70055130//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//7.00E-130//461aa//46%//P51523 
NT2RP70055200//INTEGUMENTARY MUCIN A.1 PRECURSOR (FIM-A.1) (PREPROSPASMOLYSIN).//2.40E-07// 
20 120aa//27%//P 10667 

NT2RP70061620//ZINC FINGER PROTEIN MFG-3.//3.60E-16//266aa//27%//P16374 
NT2RP70061 8B0//GTPASE-ACTIVATING PROTEIN.//1 .70E-08//265aa//21 %//P33277 

NT2RP70062960//EXCISION REPAIR PROTEIN ERCC-6 (COCKAYNE SYNDROME PROTEIN CSB).//5.80E-67// 
185aa//46%//Q03468 

25 NT2RP70063040//Homo sapiens MLL septln-IIke fusion protein (MSF) mRNA, complete cds.//1.00E-187//348aa// 
99%//AF1 23052 
NT2RP70063740 

NT2RP70064080//Drosophila melanogaster F protein (olf186) mRNA, complete cds,//2.50E-54//203aa//56%// 
AF1 88634 

30 NT2RP70064900//ZING FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//1 .20E-149//580aa//40%//P51523 
NT2RP70065270//LIM-ONLY PROTEIN 6 (TRIPLE LIM DOMAIN PROTEIN 6).//2.40E-136//341aa//61%//043900 
NT2RP70066210 
NT2RP70067010 
NT2RP70069800 

35 NT2RP70069860//ZINC FINGER PROTEIN 1 84 (FRAGMENT).//4.40E-1 41//566aa//46%//Q99676 
NT2RP70071140 
NT2RP70071540 

NT2RP70071 770//Homo sapiens multiple membrane spanning receptor TRCB (TRCB) mRNA, complete cds.//8.50E- 
61 //599aa//29%//AF064801 

40 NT2RP70072210//Rattus norvegicus sclilafen-4 (SLFN-4) mRNA, complete cds,//1 . 10E-19//244aa//32%//AF 168795 
NT2RP70072520//Human serine/tlireonine kinase mRNA, partial cds,//1 ,20E-79//154aa//100%//U79240 
NT2RP70073590 

NT2RP7007381 0//Sulfonylurea receptor 2A//4.50E-70//1 35aa//1 00%//NP_064694 
NT2RP70074060 

45 NT2RP70074220//SYNAPSIN I (BRAIN PROTEIN 4.1).//2.20E-05//116aa//34%//P17600 
NT2RP70075040 

NT2RP70075370//52 KDA RO PROTEIN (SJOGREN SYNDROME TYPE A ANTIGEN (SS-A)) (RO (SS-A)).//1 .20E- 
97//482aa//42%//P1 9474 

NT2RP70076100//Homo sapiens mRNA for putative phosphollpase, complete cds.//7.10E-189//424aa//53%// 
50 AB0 19435 

NT2RP70076170 

NT2RP70076430//PUTATIVE IMPGRTIN BETA-4 SUBUNIT (KARYOPHERIN BETA-4 

SUBUN IT).//1 .70E-42//692aa//26%//0601 00 

NT2RP70079250//Homo sapiens contactin associated protein (Caspr) mRNA, complete cds.//2.30E-218//963aa// 

55 427o//U 87223 

NT2RP70079300 

NT2RP70079750//Homo sapiens BAC526N1 8 neurexln III gene, partial cds.//5.00E-177//334aa//100%//AF1 23462 
NT2RP70081330 
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NT2RP70081370//ATP-BINDING CASSETTE, SUB-FAMILY A, MEMBER 1 (ATP-BINDING 
CASSETTE TRANSPORTER 1) (ATP-BINDING CASSETTE 1).//1 .10E-56//364aa//36%//P41233 
NT2RP70081420 

NT2RP70081440//DUAL SPECIFICITY MITOGEN-ACTIVATED PROTEIN KINASE KINASE 4 (EC 2.7.1. ) (MAP Kl- 

5 NASE KINASE 4) (JNK ACTIVATING KINASE 1) (C-JUN N- TERMINAL 

KINASE KINASE 1) (JNKK) (SAPK/ERK KINASE 1) (SEK1).//7.20E-77//1 62aa//91 %//P45985 
NT2RP70081670//85.1 KDA PROTEIN IN GREB-FEOA INTERGENIC REGION.//3.1 OE-1 08//568aa//38%//P46837 
NT2RP70083150//ENVELOPE GLYCOPROTEIN GP340 (MEMBRANE ANTIGEN) (MA) [CONTAINS: GLYCOPRO- 
TEIN GP220].//3.60E-09//431 aa//23%//P03200 

10 NT2RP70084060//Probable hexosyltransferase (EC 2.4.1 .-) SC2G5.06//4.90E-07//1 27aa//32%//T34839 
NT2RP70084410//Polybromo 1 protein - chicken //0//985aa//88%//S6067B 

NT2RP70084870//TRICHOHYALIN.//9.20E-16//452aa//21%//P37709 NT2RP70085500//MUS musculus rig-1 protein 
mRNA, complete cds.//0//976aa//86%//AF060570 

NT2RP70085570//DNA BINDING PROTEIN URE-B1 (EC 6.3.2.-).//3.40E-23//282aa//27%//P51593 
15 NT2RP70086230 

NT2RP70087200//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//1 .20E-1 64//689aa//42%//P51 523 
NT2RP70088550//Rattus norvegicus membrane-associated guanylate kinase-interacting protein 2 Maguin-2 mRNA, 
complete ccls.//7.90E-267//434aa//98%//AF1 02854 

NT2RP70090i20//CHLORIDE CHANNEL PROTEIN 7 (CLC-7) (FRAGMENT)7/0//734aa//99%//P51 798 
20 NT2RP70090190//ZINC FINGER PROTEIN 83 (ZINC FINGER PROTEIN HPF1).//3.60E-146//395aa//61%//P51522 
NT2RP70091 490//GLUCOSE TRANSPORTER TYPE 2, LIVER.//3.30E-1 7//1 09aa//37%//P14246 
NT2RP700916B0 
NT2RP70092150 

NT2RP70092360//BASEMENT MEMBRANE-SPECIFIC HEPARAN SULFATE PROTEOGLYCAN CORE 
25 PROTEIN PRECURSOR (HSPG) (PERLECAN) (PLC).//4.50E-91//1310aa//26%//Q05793 
NT2RP70092590 

NT2RP70093220//CHLORIDE CHANNEL PROTEIN 5 (CLC-5).//0//746aa//99%//P51795 NT2RP70093630 
NT2RP70093700//PUTATIVE SERINE/THREONINE-PROTEIN KINASE PKWA (EC2.7.1.).//9.90E-11//241aa//21%// 
P49695 
30 NT2RP70093730 

NT2RP70093940//Exocyst complex protein sec5//0//924aa//93%//T09220 

NT2RP70093970 

NT2RP70094290 

NT2RP70094660 

35 NT2RP7009481 0//Drosophila melanogaster Dispatched (dispatched) mRNA, complete cds.//1 .OOE-1 05//579aa//38%// 
AF200691 

NT2RP70094980//FIBULIN-1, ISOFORM A PRECURSORy/6.50E-30//211aa//38%//P23142 

NT2RP70095020 
NT2RP70095070 
40 NT0NG1 0000330 

NTONG10000520//Rattus norveglcus mRNA for Kelch related protein 1 (krpi gene).//2.00E-118//439aa//49%// 

AJ293948 

NT0NG1 0000980 

NTONG10001230//MUS msuculus mRNA, partial cds, clone CLFEST42 //6.1 0E-07//217aa//25%//D82816 
45 NTONG10001300//Gallus gallus kinectin mRNA, complete cds.//1 .30E-15//534aa//22%//U15617 

NTONG10001820//MUS musculus mammalian Inositol hexakisphosphate kinase 1 (Ip6k1) mRNA, complete cds.// 
5.20E-77//294aa//53%//AF1 771 44 

NTONG10002140//SARCALUMENIN PRECURSOR.//3.50E-204//376aa//97%//P13666 

NTONG10002460//CYCLIN-DEPENDENT KINASE INHIBITOR 1C(CYCLIN-DEPENDENT KINASE INHIBITOR P57) 
50 (P57KIP2).//1 .10E-16//156aa//40%//P49918 

NT0NG1 0002570 

NTONG10002640//HYPOTHETIGAL71.1 KD PROTEIN IN DSK2-CAT8 INTERGENIC REGION.//6.90E-98//603aa// 

39%//Q03262 

NTONG20002650//PrQbable transmembrane protein of fission yeas1//8.50E-63//539aa//28%//r39483 
55 NTONG20003340//ZINC FINGER PROTEIN 90 (ZFP-90) (ZINC FINGER PROTEIN 
NK1 0).//3.1 OE-1 08//225aa//83%//061 967 

NTONG20003630//CREB-BINDING PROTEIN. //3.00E-05//160aa//31%//Q92793 
NTONG20004920 
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NTONG20005830 
NTONG20008000 

NTONG20008780//MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN B) (CENP-B).//1 .40E-42// 

330aa//31%//P27790 

5 NTONG20009660//MUS musculus N-RAP mRNA, complete cds.//2.50E-56//393aa//38%//U7661 8 

NTONG20009850 
NTONG20011370 
NTONG20012220 
NTONG20014280 

10 NTONG20015500//ZINC FINGER PROTEIN 135.//1 .40E-128//340aa//64%//P52742 NTONG2001 61 20//Oxystyrol- 

blndlng protein homologue 1 [Mus musculus domestlcus].//3.00E-43//342aa//37%//AJ278263 

OCBBF1 0000420 

OCBBF1 0000670 

0CBBF1 0000860 
15 0CBBF1 000091 Q//SORBIN.//1.30E-71//145aa//91%//P2B220 

OCBBF1 0001 040 

OCBBF1 00011 80//TUMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1 , ENDOTHELIAL (B12 PROTEIN).// 
1 .OOE-1 2//1 24aa//37%//Q1 3829 
OCBBF1 0001 190 

20 0CBBF1 0001 220//RING CANAL PROTEIN (KELCH PROTEIN).//8.50E-32//274aa//31 %//Q04652 
OCBBF20000130 
OCBBF20001260 

OCBBF20002310//PHOSPHOLIPASE A2 INHIBITOR SUBUNIT B PRECURSOR (PLI-B).//3.50E-27//307aa//29%// 
093233 

25 OCBBF20002770//EARLY EMBRYOGENESIS ZYG-11 PROTEIN.//1 .40E-46//348aa//34%//P21541 
OGBBF20002870 

OCBBF200071 90//Putatlve cleavage and polyadenylatlon speclflty factor [Arabldopsis thallana].//1 .OOE-1 42//450aa// 
53%//AAD12712 

OCBBF20008240//THREONYL-TRNA SYNTHETASE, CYTOPLASMIC (EC 6.1.1.3) (THREONINE-TRNA LIGASE) 
30 (TH R RS) .//1 . 1 0 E-244//484aa//77%//P26639 
OCBBF20009040 
OCBBF20009980 
OCBBF20010750 

OCBBF20011010//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//1 .10E-133//405aa//55%// 
35 Q05481 

OCBBF20011240//TESTIS SPECIFIC PROTEIN A (ZINC FINGER PROTEIN TSGA).//1 .60E-81//391aa//42%// 
Q63679 

OCBBF20011400//VACUOLAR PROTEIN SORTING-ASSOCIATED PROTEIN VPS8,//3.30E-1 7//1 99aa//23%// 
P39702 

40 OCBBF20011760//MUS musculus actin-binding protein (ENC-1) mRNA, complete cds.//2.30E-236//527aa//75%// 
U65079 

OCBBF20012100//Cavia porcellus phosphatldic acid phosphatase 2a (PAP2a) mRNA, complete cds.//6.30E-29// 

285aa//29%//AF088283 

OCBBF20013070 

45 OGBBF2001 4020//MUS musculus NSD1 protein mRNA, complete cds.//0//886aa//73%//AF064553 
OCBBF2001 4080//H .sapiens mRNA for thioesterase 1 1 .//3.00E-67//1 63aa//83%//X86032 
OCBBF20014940//ublquitln-proteln ligase 1 [Arabldopsis thaliana]//2.00E-15//200aa//30%//AAF36454 
OCBBF20015270 

OCBBF20015280//MYOSIN HEAVY CHAIN, SMOOTH MUSCLE ISOFORM (SMMHC).//4.90E-23//415aa//22%// 
50 P35748 

OCBBF20015860//TRANSCRIPTION ELONGATION FACTOR S-ll (TFIIS).//7.70E-06//1 04aa//33%//P49373 
OCBBF20017060 

PANCR1 000021 0//PROTEIN DISULFIDE ISOMERASE PRECURSOR (PDI) (EC 5.3.4.1) (PROLYL 4- HYDROXYLA- 
SE BETA SUBUNIT) (CELLULAR THYROID HORMONE BINDING PROTEIN) (P55).//9.1 0E-31//231 aa//33%//P05307 
55 PANCR1 0001 850 
PEBLM1 0000290 

PEBLM10000340//RNA-BINDING PROTEIN EWS.//1 .80E-284//615aa//83%//Q01844 

PEBLM10000680//ACTIN, CYTOPLASMIC TYPE 5.//1 .30E-70//158aa//86%//P53505 PEBLM 10001 440//rrg//2.40E- 
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2 1 2//385aa//60%//GAA48220 
PEBLM1 0001 800 
PEBLM20000300 

PEBLM20001t20//Homo sapiens nucleotide-binding site protein 1 mRNA, compiete cds.//1.90E-127//899aa//34%// 

5 AF298547 

PEBLi\/l20001260 
PEBLi\/l20001470 

PEBLi\/l20002130//l\/lus muscuius genes for integrin ai\/l290, hapsin, partial and comp i ete cds.//8.20E-44//246aa// 
47%//AB036930 

10 PEBLiV120002480//ZiNC FiNGER PROTEiN 157.//8.70E-71//210aa//47%//P51786 PEBLIV120002700//Homo sapiens 
tissue-type bone marrow zinc finger protein 4 mRNA, compiete cds.//6.70E-237//537aa//80%//AF070651 
PEBLiVI20003080//ZiNC FiNGER PROTEIN 135.//2.40E-133//335aa//65%//P52742 
PEBLIVI20003950//ZINC FINGER PROTEIN 1 65.//3.20E-35//143aa//59%//P49910 

PEBLI\/I20004790//PROTO-ONCOGENE TYROSINE-PROTEIN KINASE FYN (EC 2.7.1.112) (P59-FYN).//4.70E- 
15 264//410aa//95%//P39688 

PLACE50000370//Homo sapiens mRNA for hVPSII , complete cds.//1 .80E-149//281aa//98%//AB027508 
PLAGE500005B0//PUTATIVE IMPORTIN BETA-4 SUBUNIT (KARYGPHERIN BETA-4 SUBUNIT).//9.60E-76//937aa// 
27%//060100 
PLAGE50000670 

20 PLAGE500006B0//C4B-BINDING PROTEIN PRECURSOR (C4BP).//5.1 0E-09//1 36aa//30%//P08607 

PLACE50000800//Human non-lens beta gamma-crystal lin iilce protein (AIMI) mRNA, partial cds.//0//840aa//99%// 
U83115 

PLACE50001050//iWus muscuius mRNA for heparan sulfate 6-su If otransf erase 2, complete cds.//6.10E-236//300aa// 
89%//AB024565 

25 PLACE50001130//GLUCOAMYLASE 81/82 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUC08IDA8E) 
(1 ,4-ALPHA-D-GLUCAN GLUCOHYDROLASE).//2.20E-23//729aa//24%//P08640 
PLAGE50001530 

PLACE50001700//DNAK PROTEIN (HEAT SHOCK PROTEIN 70) (HSP70).//3.00E-05//108aa//30%//Q56235 
PLACE60000440 
30 PLAGE60000700 
PLAGE60000800 
PLAGE60001370 

PLACE60002050//rRANSCRIPTIONAL REPRESSOR PROTEIN YY1 (YIN AND YANG 1) (YY-1) (DELTA TRAN- 
SCRIPTION FACTOR) (NF-E1).//6.00E-73//212aa//66%//P25490 
35 PLACE60002630 
PLAGE60003710 

PLACE60003790//PUTATIVE PRE-MRNA SPLICING FACTOR RNA HELICASE (DEAH BOX 
PROTEIN 15) (ATP-DEPENDENT RNA HELICASE #46).//5.30E-104//191aa//99%//043143 

PLACE60004240 
40 PLACE60004290 
PLAGE60005230 
PLAGE60005500 

PLACE60005550//Human (c-myb) gene, complete primary cds, and five complete alternatively spliced cds.//4.60E- 
20//71 aa//66%//U22376 
45 PLACE60009530 

PLACE60012810//Probable acyl-CoA synthetase (EC 6.2.1.-) - Mycobacterium tuberculosis (strain H37RV)//3.00E- 

1 1 //600aa//24%//C70669 

PLACE60012940 

PLACE60014430//Homo sapiens mRNA for M0CS1A & M0CS1B proteins, complete CDSs.//1 .60E-146//283aa// 

50 98%//AJ224328 

PLAGE60018860//ADENYLATE CYCLASE, TYPE IV (EC4.6.1.1) (ATP PYROPHOSPHATE-LYASE) (ADENYLYL CY- 
CLASE) .//1 .30E-244//504aa//90%//P26770 
PLACE60019230 
PLACE60019250 
55 PLACE60020160 

PLACE60020840//CYTOCHROME B561 (CYTOCHROME B-561)y/1 .00E-45//211aa//47%//Q95245 
PLACE60021020 

PLACE60021510//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//5.00E-140//351aa//61%//P51523 
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PLACE600241 90//TRICHOHYALIN.//1 .40E-09//299aa//24%//P37709 

PLACE600266B0//Homo sapiens Arg/Abl-interacting protein ArgBP2b (ArgBP2b) mRNA, partial cds.//1.00E-254// 

484aa//96%//AF049885 

PLACE60026920 

5 PLACE60026990//Human PI\/1S2 related (hPMSR6) mRNA, complete cds.//5.70E-42//112aa//75%//U38980 
PLACE60029490 

PLACE60030380//ZINC FINGER PROTEIN 84 (ZINC FINGER PROTEIN HPF2).//3.60E-34//335aa//30%//P51523 
PLACE60030940 
PLACE600310 
10 PLACE60032040 
PLACE60033720 

PLACE60033990//SPIDROIN 1 (DRAGLINE SILK FIBROIN 1) (FRAGMENT).//1 .40E-08//234aa//27%//P19837 
PLACE60037050 
PLACE60037400 
15 PLACE60037450 

PLACE60038500//Homo sapiens mitochondrial solute earner mRNA, complete cds.//5.10E-65//171aa//70%// 

AF1 55660 
PLACE60040050 
PLAGE60043120 
20 PLAGE60043360 
PLACE60043960 

PLACE60043970//Tal<ifugu rubripes retinitis pigmentosa GTPase regulator-like protein gene, partial cds.//2.60E-14// 
329aa//2 1 %//AF2B6475 

PLACE60044540//GLUCOAMYLASE S1/S2 PRECURSOR (EC 3.2.1.3) (GLUCAN 1 ,4-ALPHA-GLUCOSIDASE) 
25 (1 ,4-ALPHA-D-GLUCAN GLUC0HYDR0LASE).//1 .90E-46//865aa//26%//P08640 

PLACE60044640//Human placenta (Diff48) mRNA, complete cdsy/2.60E-88//414aa//45%//U49187 

PLACE60044910 

PLACE60046630 

PLACE60046870 
30 PLACE60047380 

PLACE60049310 

PLACE60049930 

PLACE60050290 

PR0ST1 0001 520 

35 PROST10001670//SYNAPSIN I (BRAIN PROTEIN 4.1).//2.20E-07//239aa//28%//P17600 
PR0ST1 0002200 
PR0ST1 0002460 

PROST10002720//Home sapiens mRNA for calsyntenin-2 (CS2 gene).//7.00E-23//153aa//42%//AJ278018 
PR0ST1 0003430//Numb-binding protein LNXp80//0//732aa//87%//T09457 

40 PR0ST1 0005260 

PROST10005360//Homo sapiens contactin associated protein (Caspr) mRNA, complete cds.//1.30E-152//719aa// 
38%//U 87223 
PR0ST1 0005640 
PROST20000360 

45 PROST20000530//60S RIBOSOMAL PROTEIN L13A (23 KDA HIGHLY BASIC PROTEIN).//2.00E-33//73aa//95%// 
P40429 

PROST20001760//RD PROTEIN (WL623).//7.30E-12//104aa//41%//P19426 

PROST20002060 

PROST20002670 

50 PROST20002730//H4(D1 0S1 70) PROTEIN.//2.60E-63//11 8aa//9B%//Q1 6204 

PROST20002740 

PROST20003250//Homo sapiens DAZ associated protein 1 (DAZAP1) mRNA, complete cds.//2.50E-150//264aa// 

98%//AF1B1719 
PROST20004630 
55 PROST20017390 
PROST20017960 

PROST20018230/n"RANSCRIPTION FACTOR SP1 .//4.60E-59//287aa//42%//Q01 714 
PROST20018990//Human Rar protein mRNA, complete cds.//1 .20E-127//278aa//88%//U05227 
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PROST20019980 
PROST20021620 

PROST20023380//Cca3 protein//5.20E-68//142aa//94%/A'31081 

PROST20025910 

5 PROST20026820//DENTIN MATRIX ACIDIC PHOSPHOPROTEIN 1 PRECURSOR (DENTIN MATRIX PROTEIN-1) 

(DM P-1 ) (AG 1 ) .//6. 90 E-05//255aa//27%//0551 88 
PROST20028420 

PROST20029600//Human butyrophilin (BTF1) mRNA, complete cds.//2.00E-61//144aa//87%//U90543 
PROST20031020 

10 PROST20031170//DNA BINDING PROTEIN URE-B1 (EC 6.3.2.-).//1 .70E-23//282aa//27%//P51593 
PROST20032100 

PROST20032320//BETA-GLUCURONIDASE PRECURSOR (EC 3.2.1.31) (BETA-G1).//1 .20E-18//46aa//91%// 
P08235 

PROST20033020 
15 PROST20033030 

PROST20033380//KINESIN LIGHT CHAIN (KLC).//7.70E-08//146aa//27%//P46824 

PROST20033400//SERINE/rHREONINE-PROTEIN KINASE 9 (EC 2.7.1 .-).//1.80E-25//55aa//100%//076039 
PROST20034720//IMMEDIATE-EARLYPROTEIN.//7.80E-11//180aa//24%//Q01042 
PROST20037320 
20 PROST20039220 

PROST20043320//Homo sapiens mRNA for BAP2-beta protein, complete cds.//3.70E-227//431aa//99%//AB015020 
PROST200441 60/n"ROPOMYOSIN 5, CYTOSKELETAL TYPE.//7.80E-4B//1 00aa//97%//P21 1 07 
PROST20044810 

PROST20051210//PROBABLE PROTEIN PHOSPHATASE 20 T23F11.1 (EC 3.1.3.16) (PP2C).//7.00E-16//74aa// 
25 41%//P49596 

PROST20051430 
PROST20054260 
PROST20056040 
PROST20058a00 

30 PROST20059190//Human breast cancer, estrogen regulated LIV-1 protein (LIV-1) mRNA, partial cds, //I. 50E-28// 
109aa//55%//U41060 
PROST20059430 
PROST20061960 

PROST20062600//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 
35 2.20E-07//269aa//22%//Q08170 

PR0ST20064500//N-HYDROXYARYLAMINE SULFOTRANSFERASE (EC 2.8.2.-) (HAST-l).//9.10E-38//102aa// 

70%//P50237 

PROST20067370//H .sapiens mRNA for XIAP associated factor-1 .//2.40E-122//223aa//99%//X99699 
PROST20069880//MUS musculus DXImx48e protein (DXImx48e) mRNA, complete cds.//9.6e-316//750aa//79%// 

40 AF229644 

PROST20072370 

PROST20072890/n-UMOR NECROSIS FACTOR, ALPHA-INDUCED PROTEIN 1 , ENDOTHELIAL (B12 PROTEIN).// 

1 .50 E-11 7//3 1 4aa//70%//Q 1 3829 

PROST20073170//Rattus norvegicus zinc finger protein RIN ZF mRNA, complete cds.//1 .80E-268//559aa//87%// 
45 AF091457 PROST20073890//VASCULAR ENDOTHELIAL GROWTH FACTOR PRECURSOR (VEGF) (VASCULAR 
PERMEABILITY FACTOR) (VPF).//5.30E-33//63aa//98%//P15692 

PROST20079740//ANTER-SPECIFIC PROLINE-RICH PROTEIN APG (PROTEIN CEX) (FRAGMENT) .//4.10E-09// 
97aa//36%//P40603 

PROST200B5160//rROPOMYOSIN, CYTOSKELETAL TYPE (TM30-NM).//2.10E-93//220aa//87%//P12324 
50 PROST20094830 
PUAEN1 0000570 
PUAEN1 0000810 

PUAEN1 0001 61 0//GENERAL NEGATIVE REGULATOR OFTRANSCRIPTION SUBUNIT 1 .//2.1 0E-68//448aa//32%// 
P25655 
55 PUAEN 10003220 
SALGL1 0000050 

SALGL1 0000470//NG36 [Homo sappiens]//3.00E-53//1 84aa//96%//AAD21 81 1 

SALGL1 0000650//POLYHOMEOTIC-PROXIMAL CHROMATIN PROTEIN.//4.60E-08//71 aa//43%//P39769 



194 



EP 1 293 569 A2 



SALGL10001570//APOLIPOPROTEIN L PRECURSOR (APO-L),//5.10E-99//338aa//61%//014791 
SKMUS10000140//Polyubiquitin 9 - human.//2.10E-199//2B0aa//96%//M26880 

SKMUS10000220//NUCLEAR PORE PROTEIN SEH1 HOMOLOG.//2.00E-58//346aa//39%//Q10099 
SKMUS10000640//MUS musculus RING-finger protein MURF mRNA, complete cds.//4.10E-111//348aa//60%// 
5 AF294790 

SKMUS10001040//Homo sapiens mRNA for HEXiMI protein, complete cds.//2.40E-49//256aa//47%//AB0211 79 
SKIVIUSI 00011 BO/'/Homo sapiens t(3;5)(q25.1 ;p34) fusion gene NPIVI-MLFI mRNA, complete cds.//3.60E-126// 
258aa//94%//L49054 
SKIVIUS10001240 

10 SKIVIUS10001290/ISOPENTENYL-DiPHOSPHATE DELTA-ISOMERASE (EC 5.3.3.2) (IPP ISOMERASE) (ISOPEN- 
TENYL PYROPHOSPHATE ISOMERASE).//4.10E-81//227aa//64%//Q13907 

SKMUS10001770//PROTEIN-L-ISOASPARTATE 0-METHYLTRANSFERASE (EC 2.1.1.77) (PROTEIN- BETA-AS- 
PARTATE METHYLTRANSFERASE) (PIMT) (PROTEIN L-ISOASPARTYL 
METHYLTRANSFERASE) (L-ISOASPARTYL PROTEIN CARBOXYL 
15 IVIETHYLTRAN S FE RAS E) .//2 .50 E-1 8//2 1 3aa//29%//0269 1 5 

SKI\/lUS20000740//Homo sapiens methyltransferase C0Q3 (C0Q3) mRNA, complete cds.//8.50E-166//309aa//99%// 
AF193016 

SKIV1US20001170//Homo sapiens IVIAGEFI (MAGEF1) mRNA, complete cdsy/1 .50E-74//305aa//50%//AF295378 

SKI\/IUS20002710 
20 SKIV1US20003430 

SKMUS20003650//Human (p23) mRNA, complete cds.//7.00E-20//1 1 0aa//39%//L24B04 SKMUS20003900//Homo sa- 
piens 38 l<Da Mov34 homolog mRNA, complete cds.//1 .20E-152//286aa//99%//U70734 
SKMUS20004580//MUS musculus N-RAP mRNA, complete cds.//4.50E-160//591aa//56%//U76618 
SKMUS20004670 
25 SKMUS20004680 

SKMUS20007240//Homo sapiens mRNA for 2-hydroxyphytanoyl-CoA lyase.//2.60E-1 48//31 8aa//88%//AJ1 31 753 
SKMUS20007740//BALBIANI RING PROTEIN 1 (GIANT SECRETORY PROTEIN l-A) (GSP-IA) (FRAGIVIENT).// 

1 .30E-08//1 38aa//26%//P02849 
SKI\/IUS20008470 

30 SKIV1US20008630//PROBABLE ASPARAGINYL-TRNA SYNTHETASE (EC 6.1.1 .22) (ASPARAGINE-TRNA LIGASE) 
(ASN RS).//1 . 1 0E-1 03//445aa//46%//P52276 

SKI\yiUS20009020//BR01 PROTEIN.//2.30E-08//232aa//26%//P48582 

SKMUS20009330//RNA polymerase III subunit [Homo sapiens]//1 .80E-44//216aa//47%//U93868 
SKMUS20009450 

35 SKMUS20009540//Homo sapiens F-box protein Fbx25 (FBX25) mRNA, partial cds.//4.20E-93//263aa//64%// 
AF1 74605 

SKMUS2001 0080//MUS musculus mRNA for a skeletal muscle and cardiac protein.//1 .00E-75//1 78aa//87%//AJ01 1118 
SKMUS20011 290//NAD-DEPENDENT METHANOL DEHYDROGENASE (EC1, 1,1, 244) (MEDH),//3,70E-45//1 95aa// 
32%//P31 005 

40 SKMUS2001 1 470//MUS musculus RP42 mRNA, complete cds.//1 .30E-32//1 86aa//36%//AF1 98092 
SKMUS20013640 

SKMUS20014920//Zinc finger protein//4.40E-05//153aa//24%//T37771 
SKMUS20015010 

SKMUS20015430//Homo sapiens HDCMC29P mRNA, partial cds.//3.50E-128//248aa//97%//AF068295 
45 SKMUS20016080 
SKMUS20016310 

SKiy/IUS20016340//HIGH MOBILITY GROUP PROTEIN HMG2 (HMG-2).//6.00E-11//170aa//25%//P26583 

SKMUS20016620//Oryctolagus cuniculus CARP mRNA, complete cds.//2.70E-43//196aa//51%//AF131883 

SKMUS20016680//Neuron-speciflc signal trunduction protein Stac//8.30E-33//218aa//38%//NP_058549 
50 SKMUS20016710 

SKNMC1 0000070 

SKNMC1 00001 00 

SKNMC1 00001 90 

SKNMC1 0000290 
55 SKNMC10001100 

SKNMCI 0001 590 

SKNIVIC1 0001 680 

SKNMCI 0002290 
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SKNMC1 000251 0//Homo sapiens MT-ABC transporter (MTABC) mRNA, complete cds.//0//672aa//93%//AF076775 
SKNMC1 0002640 

SKNMC20000650/yZINC FINGER PROTEIN 136.//7.20E-05//311aa//23%//P52737 
SKNMC20000970//M.musculus mRNA for protein Htf9C.//9.80E-220//552aa//75%//X56044 
5 SKNMG20002240//ZINC FINGER PROTEIN 228.//1 .80E-68//226aa//53%//Q9UJU3 
SKNMC20003050 

SKNMC20003220//MAJOR CENTROMERE AUTOANTIGEN B (CENTROMERE PROTEIN B) (CENP-B)7/3.70E-1 0// 
153aa//32%//P07199 

SKNMC20003560//MUS musculus IVIax-interacting transcriptional repressor (MadS) mRNA, complete cds.//1 .90E-72// 
10 168aa//86%//U32394 
SKNMC20005930 
SKNMC20006120 
SKNMC20010570 

SKNMC20011130//Rattus norveglcus golgi peripheral membrane protein p65 (GRASP65) mRNA, complete cds.// 

15 6.90E-90//244aa//66%//AF01 5264 

SKNMC20015030//P-SELECTIN GLYCOPROTEIN LIGAND 1 PRECURSOR (PSGL-1) (SELECTIN P LIGAND).// 

3.60E-1 1 //1 52aa//32%//Q621 70 
SKNMC20015550 

SKNMC20Q1 5960//Homo sapiens mRNA for ANKHZN, complete cds.//0//1 046aa//95%//AB037360 
20 SKNSH1 0000860 

SKNSH10001740//ORNITHINE DECARBOXYLASE (EC4.1.1.17) (ODC).//9.00E-102//352aa//53%//P00860 
SKNSH1 000301 0//Homo sapiens DRC3 mRNA, complete cds.//3.00E-154//305aa//91%//AF282167 
SKNSH1 0003080 
SKNSH20001510 
25 SKNSH20001630 

SKNSH20003470//CYTOCHROME B2 PRECURSOR (EC 1.1.2.3) (L-LACTATE DEHYDROGENASE (CYTO- 
CHROME)) (L-LACTATE FERRICYTOCHROME C OXIDOREDUCTASE) (L-LCR).//2.60E-07//107aa//32%//P00175 
SMINT1 00001 60//2-HYDROXYACYLSPHINGOSINE 1-BETA-GALACTOSYLTRANSFERASE 
PRECURSOR (EC 2.4.1 .45) (UDP-GALACTOSE-CERAMIDE GALACTOSYLTRANSFERASE) 
30 (CERAMIDE UDP-GALACTOSYLTRANSFERASE) (CEREBROSIDE SYNTHASE).//3.80E-71//492aa//33%//064676 
SMINT1 0000390 

SMINT1 0000420//ATP-BINDING CASSETTE, SUB-FAMILY A, MEMBER 3 (ATP-BINDING 

CASSETTE TRANSPORTER 3) (ATP-BINDING CASSETTE 3) (ABC-C TRANSPORTER).//2.70E-92//662aa//34%// 
099758 
35 SMINT1 0000540 

SMINT1 0000570//Homo sapiens leucocyte immunoglobulin-like receptor-8 (LIR-8) mRNA, complete cds.//5.30E-21 2// 

481aa//84%//AF025534 
SMINT1 0000710 

SMINT10001000//PAIRED MESODERM HOMEOBOX PROTEIN 2B (PAIRED-LIKE HOMEOBOX 2B) (PH0X2B 
40 HOMEODOMAIN PROTEIN) (NEUROBLASTOMA PHOX) (NBPH0X).//1 .60E-05//87aa//39%//Q99453 

SMINT1 0001 030//Homo sapiens ankyrin repeat-containing protein ASB-2 mRNA, complete cds.//1 .20E-292//546aa// 

99%//AF159164 

SMINT10001180 

SMINT20000180 
45 SMINT20000400 

SMINT20001450//Halocynthia roretzi mRNA for HrPET-3, complete cds.//2.30E-20//125aa//40%//AB029335 

SMINT20002270 

SMINT20002390 

SMINT20002770//BUTYROPHILIN PRECURSOR (BT).//3.20E-51//269aa//41%//P1 8892 
50 SMINT20003960//A kinase anchor protein AKAP-KL isoform 2 //5.00E-254y/738aa//70%//T09226 
SMINT20004000//Homo sapiens FRG1 mRNA, complete cds.//7.00E-52//116aa//90%//L761 59 
SMINT20005450//MUS musculus Zfp228 (Znf228) mRNA, complete cds.//1 .20E-31//125aa//49%//AF282919 

SMINT20005580 
SPLEN1 0000490 

55 SPLEN1 000091 0//Homo sapiens HRIHFB2007 mRNA, partial cds,//5.20E-95//1 99aa//90%//AB01 5330 

SPLEN10001430//HIGH MOBILITY GROUP PROTEIN HMG1 (HMG-1).//3.00E-78//147aa//100%//P09429 
SPLEN20000200//Human (c-myb) gene, complete primary cds, and five complete alternatively spliced cds.//3.10E- 
1 1 //64aa//57%//U 22376 
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SPLEN20000470 

SPLEN20000720//ZINC FINGER PROTEIN CKR1 .//3.10E-37//235aa//37%//P30373 

SPLEN20001340//CARBOXYPEPTIDASE S PRECURSOR (EC3,4. 1 7.4) (YSGS) (GLY-X CARBOXYPEPTIDASE).// 

3.30E-29//250aa//37%//P27614 
5 SPLEN20001970//SPLICING FACTOR, ARGININE/SERINE-RICH 4 (PRE-MRNA SPLICING FACTOR SRP75).// 

2.70E-1 4//243aa//28%//Q081 70 

SPLEN20002420 

SPLEN20002430 

SPLEN20002670//Rattus norvegicus TGF-beta resistance-associated protein (TRAG) mRNA, complete cds.//0// 
10 559aa//84%//AF305813 
SPLEN20002700 
SPLEN20003100 

SPLEN20003570//MUS musculus RalGDS-like protein 3 mRNA, complete cds.//8.30E-191//453aa//81%//AF237669 

SPLEN20004430 
15 SPLEN20004960 

SPLEN20005410 

ST0MA1 0000470 

ST0MA1 0000520 

ST0MA1 0001 170 
20 ST0MA1 0001 330 

STOMA10001860//CYTOSOLIC ACYL COENZYME A THIOESTER HYDROLA (EC 3.1 .2.2) 

(LONG CHAIN ACYL-COA THIOESTER HYDROLASE) (CTE-ll).//2.70E-173//328aa//99%//000154 

STOMA20000320 

STOMA20000880//IG LAMBDA CHAIN C REGI0NS.//1 .50E-51//105aa//96%//P01842 
25 STOMA20001 21 0//Fugu rubripes CCBL1 gene, exons 1 to 1 2.//1 .90E-1 27//41 5aa//55%/A'1 7462 
STOMA20001880 
STOMA20002570 

STOMA20002890 

STOMA20003960//LIM-ONLY PROTEIN 6 (TRIPLE LIM DOMAIN PROTEIN 6).//3.60E-59//352aa//39%//043900 
30 STOMA20004780 

STOMA20004820//1-phosphatldyllnosltol-4,5-bisphosphate phosphodiesterase (EG 3. 1. 4. 11) delta-2//6.90E-148// 
325aa//84%/S14113 

SYNOV10001280/'/Homo sapiens colon cancer-associated protein Mid (MIC1) mRNA, complete cds.//1.1e-316// 
609aa//97%//AF1 43536 
35 SYN0V1 0001 640 
SYNOV20001770 

SYNOV20002910//Arabinogalactan-likeprotein//2.90E-07//124aa//31%//S52994 

SYNOV20008200//Trichoplusla nl transposon IFP2.//4.1 0E-13//254aa//27%//J04364 SYNOV20010140//Mus muscu- 
lus Zfp228 (Znf228) mRNA, complete cds.//2.50E-31//125aa//49%//AF282919 
40 SYNOV20011440 

SYNOV20013740//ZINC FINGER PROTEIN 134.//1 .80E-108//332aa//53%//P52741 

SYNOV20014510//MYOCYTE-SPECIFIC ENHANCER FACTOR 2B (SERUM RESPONSE FACTOR-LIKE PROTEIN 

2) (XMEF2) (RSRFR2).//6.40E-140//302aa//88%//Q02080 

SYNOV20014570 

45 SYNOV20016480//rHYMIDINE PHOSPHORYLASE PRECURSOR (EC 2,4,2,4) (TDRPASE) (TP) (PLATELET-DE- 
RIVED ENDOTHELIAL CELL GROWTH FACTOR) (PD-ECGF) (GLIOSTATIN).//2.80E-35//69aa//100%//P 19971 
TEST1 10000230 

TESTI10000250//M .musculus mRNA for testls-speclfic protein, DDC8.//2.60E-68//462aa//42%/A'09878 
TESTI10000420//PARAMYOSIN (ANTIGEN SJ97).//2.50E-08//367aa//23%//Q05870 
50 TESTI1 000051 0//CYTADHERENCE HIGH MOLECULAR WEIGHT PROTEIN 2 (CYTADHERENCE ACCESSORY 
PROTEIN 2).//4.00E-1 3//648aa//22%//P75471 

TESTI10000550//HOMEOBOX PROTEIN SIX1 (FRAGMENT).//8.90E-11//1 99aa//30%//Q62231 

TEST1 1 0000640//Fugu rubripes sex comb on midleg-like 2 protein (SCML2) gene, complete cds.//2.20E-1 40//51 3aa// 

49%//AF 146688 

55 TESTI10000700//Rattus norvegicus deubiquitinating enzyme Ubp69 (ubp69) mRNA, complete cds.//7.00E-297// 
61 8aa//88%//AF1 06659 
TEST1 10000960 
TEST1 10001 250 
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TESTI10001270//POLYCYSTIN PRECURSOR (AUTOSOMAL DOMINANT POLYCYSTIC KIDNEY DISEASE PRO- 
TEIN 1 ).//3.60E-1 61 aa//33%//P981 61 

TEST1 10001 31 0//Homo sapiens TCP11 (TCP11) mRNA, complete cds.//1 .90E-223//425aa//100%//AF269223 
TESTI10001380//NEUROENDOCRINE CONVERTASE 3 PRECURSOR (EC 3.4.21 .61) (NEC 3) 
5 (PC4) (PROHORMONE CONVERTASE 3) (KEX2-LIKE ENDOPROTEASE 3).//1 .10E-140//303aa//85%//P29121 
TEST1 10001 630 

TESTI10001680//PROTEIN PHOSPHATASES PP1 REGULATORY SUBUNIT SDS22.//'4.30E-14//158aa//35%// 
P36047 

TESTI1 0001 790 

10 TESTI10001910//Homo sapiens 88-kDa Golgl protein (GM88) mRNA, complete cds.//8.70E-77//274aa//59%// 
AF204231 
TESTI20000180 

TESTI20000440//TRiCHOHYALIN.//3.00E-16//476aa//26%//P22793 

TESTi20001200//Homo sapiens mRNA for zinc finger 3 (ZF3 gene).//6.90E-13//108aa//40%//X60153 
15 TESTI20001540//PUTATIVE SERINE/THREONINE-PROTEIN KINASE D1 044.3 IN CHROMOSOME III (EC 2.7.1 .-).// 
1 .30E-32//1 03aa//48%//P41 951 

TESTI20001770//INTER-ALPHA-TRYPSIN INHIBITOR HEAVY CHAIN H3 PRECURSOR (ITI 

HEAVY CHAIN H3) (SERUM-DERIVED HYALURONAN-ASSOCIATED PROTEIN) (SHAP).//5.10E-05//157aa//22%// 

Q06033 
20 TESTI20001790 

TESTI20001 840//SIT4-ASSOCIATING PROTEIN SAP1 85.//7.90E-0B//1 09aa//33%//P40856 
TESTI20002070//NIFU-LIKE PROTEIN.//3.60E-43//105aa//80%//Q9ZD61 

TESTI20002080//Homo sapiens mRNA for Gab2, complete cds.//7.60E-62//222aa//60%//AB01 8413 
TESTI20002380 

25 TESTI20002530//Homo sapiens A1 U mRNA, complete cds.//6.60E-1 7//220aa//31 %//AF1 88240 
TESTI20003560//TUBULIN ALPHA-3/ALPHA-7 CHAIN.//2.00E-40//1 1 9aa//73%//P0521 4 
TESTI20003720 

TESTI20004350//CALDESMON (CDM).//1 .20E-09//180aa//23%//P12957 
TESTI20004620 
30 TESTI20005200 

TESTI20005910//ADENYLATE KINASE, CHLOROPLAST (EC 2.7.4,3) (ATP-AMP TRANSPHOSPHORYLASE).// 
3.60E-34//209aa//37%//P431 88 

TESTI20006000//RESTIN (CYTOPLASMIC LINKER PROTEIN-170 ALPHA-2) (CLIP-170) 
(REED- STERNBERG INTERMEDIATE FILAMENT ASSOCIATED PROTEIN).//6.80E-21//196aa//32%//P30622 
35 TESTI2O006270 
TESTI20006710 

TESTI20006950//KINESIN HEAVY CHAIN.//7.80E-07//391aa//22%//P21613 

TESTI20006990//KINESIN-LIKE PROTEIN KIF2 (KINESIN-2) (HK2).//2.00E-1 B4//539aa//63%//000139 
TESTI20007070//DOUBLESEX PROTEIN, MALE-SPECIFIC.//3.00E-13//163aa//31%//P23023 
40 TESTI20007620//DRA PROTEIN (DOWN-REGULATED IN ADEN0MA).//1 .60E-09//88aa//38%//P40879 
TESTI20007840//PUTATIVE IMPORTIN BETA-4 SUBUNIT (KARYOPHERIN BETA-4 
SUBUN IT).//6.00E-89//1 092aa//26%//0601 00 
TESTI20008190 
TESTI20008300 

45 TESTI20008490//MYOSIN HEAVY CHAIN, CLONE 203 (FRAGMENT).//4.90E-09//331 aa//24%//P39922 

TESTI20008830//MYOSIN-BINDING PROTEIN C, SLOW-TYPE (SLOW MYBP-C) (C-PROTEIN, SKELETAL MUS- 
CLE SLOW-ISOFORM).//4.20E-88//1 62aa//1 00%//Q00872 
TESTI20009090 
TESTI20009510 

50 TESTI20009700 
TESTI20010080 

TESTI20010490//HYPOTHETICAL ZINC FINGER PROTEIN KIAA0961.//2.70E-155//504aa//56%//Q9Y2G7 

TESTI20010820 
TESTI20011340 

55 TESTI20011410//Rattus norveglcus actin-filament binding protein Frabin mRNA, complete cds.//0//766aa//83%// 
AF038388 

TESTI2001 1 B00//TRICHOXYALIN.//5.80E-07//322aa//22%//Q07283 

TESTI20012370//RING CANAL PROTEIN (KELCH PR0TEIN).//1 .60E-38//438aa;/26%//Q04652 
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TESTI20012690//DIHYDROLIPOAMIDE ACETYLTRANSFERASE COMPONENT OF PYRUVATE 
DEHYDROGENASE COMPLEX, MITOCHONDRIAL PRECURSOR (EC 2.3.1.12) (E2) (PDC-E2) (70 KDA MITO- 
CHONDRIAL AUTOANTIGEN OF PRIMARY BILIARY CIRRHOSIS) (PBC) (M2 
ANTIGEN COMPLEX 70 KDA SUBUNIT).//3.40E-308//575aa//99%//P10515 
5 TESTI20013060 

TESTI20013300//Homo sapiens NY-REN-60 antigen mRNA, partial cds.//1.10E-172//315aa//99%//AF155116 
TESTi20013450//M.muscuiusTenrmRNAfor RNA binding protein.//3.00E-273//'576aa//'88%//X84693 
TESTI20013520 

TESTI20014120//TRICHOHYALiN.//1.40E-28//370aa//27%//P37709 
10 TESTI20014200//D.melanogaster mRNA for putative organic cation transporter, 
2064 bp.//4.20E-54//357aa//33%//Y1 2400 

TESTI20015110//MYOSIN II HEAVY CHAIN, NON MUSCLE.//3.50E-07//255aa//24%//P08799 
TESTI20015120//TOM1 (target of nnyb 1)//1 .00E-57//245aa//56%//NP_005479 

TESTI20015560//ZING FINGER PROTEIN 151 (MYC-INTERACTING ZINC FINGER PROTEIN) (MIZ-1 PROTEIN).// 
15 9.50E-16//278aa//28%//Q13105 
TESTI20015930 
TESTi20016210 

TESTi2001 661 0//DYNEIN BETA CHAIN, FLAGELLAR OUTER ARM.//1 .1 OE-1 7//432aa//25%//Q39565 
TESTi20016650//IMMEDIATE-EARLYPROTEIN.//9.50E-06//111aa//28%//Q01042 
20 TESTI20016710 
TESTI20017580 
TESTI20017660 
TESTI20017920 

TESTI20018150//GASTRULAZINC FINGER PROTEIN XLCGF7.1 (FRAGMENT).//7.20E-13//98aa//37%//P18735 
25 TESTI20018260 

TESTI20018270//TRANSKETOLASE (EC 2.2.1.1) (TK) (P68).//6.90E-230//614aa//67%//P40142 
TESTI20018290 

TESTI2001 8520//Homo sapiens contactin associated protein (Caspr) mRNA, complete cds.//5.40E-1 67//724aa//40%// 
U87223 
30 TESTI20018620 

TESTI2001 8690//Xenopus laevis bIcaudal-C (BIc-C) mRNA, complete cds.//1 .20E-08//189aa//30%//AF224746 
TESTI20018790//ZINC FINGER PROTEIN 157.//8.70E-104//443aa//45%//P51786 
TESTI20018980 
TESTI20019500 
35 TESTI20019680 
TESTI20019910 
TESTI20020020 
TESTi20020480 

TESTi20020570//Human actin-lil<e peptide mRNA, partial cds.//6.10E-140//307aa//8B%//U20582 
40 TESTi20020810//HYPOTHETICAL 80.0 KDA PROTEIN IN P0L1-RAS2 INTERGENIC REGiON,//5,30E-40//235aa// 
35%//P50944 
TESTi20020900 

TESTI20021050//MICRONUCLEAR LINKER HISTONE POLYPROTEIN (MIC LH) [CONTAINS: LINKER HISTONE 
PROTEINS ALPHA, BETA, DELTA AND GAMMA].//3.90E-11//365aa//24%//P40631 
45 TESTI20021490//ZINC FINGER PROTEIN 131 (FRAGMENT).//5.30E-191//347aa//99%//P52739 
TESTI20022230//Chlamydomonas relnhardtii strain 1132D- flagellar 

protofllament ribbon protein (RIB43a) mRNA, complete cds.//7.70E-12//137aa//31%//AF1 96576 

TESTI20022450 

TESTI20022510 

50 TESTi20022560//GUANYLATE KINASE (EC 2.7.4.8) (GMP KINASE).//2. 20E-20//1 88aa//31 %//Q64520 

TESTi20022640 

TEST120022940//MOB2 PROTEIN (MPS1 BINDER 2).//4.80E-1 6//133aa//31%//P43563 
TESTi20023610 
TESTi20023690 
55 TESTi20024150 

TESTi20024230//PiSTIL-SPECIFIC EXTENSIN-LIKE PROTEIN PRECURSOR 
(PELP).//8.40E-1 5//1 55aa//34%//Q0321 1 

TESTI2002461 0//TRICHOHYALIN.//3.60E-1 3//423aa//23%//P37709 
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TESTI20024650//FIBROIN HEAVY CHAIN PRECURSOR (FIB-H) (FRAGMENTS).//4.70E-06//142aa//33%//P05790 
TESTI20024670 

TESTI20024980//Danio rerio p55-related MAGUK protein DLG3 (dIgS) mRNA, complete cds.//2.00E-221//532aa// 
75%//AF 124435 

5 TESTI20025160//MELANOMA-ASSOCIATED ANTIGEN B1 (MAGE-B1 ANTIGEN) (MAGE-XP ANTIGEN) (DAM10). 
//1 .40E-89//349aa//54%//P43366 

TESTI20025440 
TESTI20025800 
TESTI20026320 

10 TESTI20026760//ZINC-BINDING PROTEIN A33.//1 .60E-38//235aa//37%//Q02084 
TESTI20026980 
TESTI20027000 

TESTI20027070//PLASMA-CELL MEMBRANE GLYCOPROTEIN PC-1 [INCLUDES: ALKALINE 
PHOSPHODIESTERASE I (EC 3.1.4.1); NUCLEOTIDE PYROPHOSPHATASE (EC 3.6.1.9) (NPPASE)].//1 .50E-50// 

15 406aa//33%//P22413 

TESTI20027290//Homo sapiens mRNAforoligoplirenin 1.//2.50E-56//393aa//38%//AJ001189 
TESTi20027890//ZINC FINGER PROTEIN 33A (ZINC FINGER PROTEIN K0X31) (KIAA0065) (HA0946) (FRAG- 
MENT) .//9.40E-91//472aa//42%//006730 
TESTi2002a060 

20 TESTi20028400 
TESTI20028660 

TESTI20029120//DUAL-SPECIFICITY TYROSINE-(Y)-PHOSPHORYLATION REGULATED KINASE (EC 2.7.1.-) 

(PROTEIN KINASE MINIBRAIN HOMOLOG) (HP86).//1 .90E-47//253aa//41%//Q13627 

TESTI20029650 

25 TESTI20030050//MUS musculus taube nuss mRNA, complete cds.//1 .70E-119//260aa//91%//AF222802 
TESTI20030370 

TESTi20030590//TESTIS-SPECIFIC PROTEIN PBS13.//1 .70E-33//117aa//61%//Q01755 

TESTi20030710//Homo sapiens C2H2 (Kruppei-type) zinc finger protein mRNA, 

complete cds.//9,80E-18//139aa//46%//AF1 59567 
30 TESTI20030740//TRICHOHYALIN.//8.30E-12//368aa//24%//P22793 

TESTI20031090/A/ACUOLAR PROTEIN 8.//3.80E-23//367aa//27%//P39968 

TESTI20031170//rektln A1 [Strongylocentrotus purpuratus]//3.40E-91//397aa//45%//M97188 

TESTI20031300 

TESTI20031520 
35 TESTI20031930 

TESTI20031960 

TESTI20032280 

TESTi20032550 

TESTi20032800 
40 TESTi20032990 

TESTi20033250//SALIVARY PROLINE-RICH PROTEIN PRECURSOR (CLONE CP7) [CONTAINS: BASIC PEPTIDE 
P-F] (FRAGMENT).//4.50E-05//138aa//28%//P02812 

TESTi20033270//Testis-specific protein [Homo sapiens]y/1 .00E-22//120aa//47%//NP_067063 TESTI20033540//TRI- 
CHOH YALI N .//1 .80E-1 3//443aa//22%//P37709 
45 TESTI20033560 
TESTI20033760 

TESTI200341 30//Zinc finger protein 1 06//9.30E-263//781 aa//66%//T1 4273 
TESTI20034180 

TESTI200341 90//Homo sapiens very long-chain acyl-CoA synthetase (BG1) mRNA, complete cds.//3.00E-131// 
50 468aa//53%//AF1 79481 

TESTi20034980//TRIPLE FUNCTIONAL DOMAIN PR0TEIN//1 .00E-77//250aa//55%//075962 
TESTi20035120//SYNAPTONEMAL COMPLEX PROTEIN 1 (SCP-1 PROTEIN).//9.30E-06//459aa//20%//062209 

TESTi20035410 

TESTi20035510//Proiiferating-ceil nucleolar antigen P120-like protein - Archaeogiobus fuigidus.//3.00E-12//200aa// 
55 357o//F69504 

TESTi20035740//A-KINASE ANCHOR PROTEIN 150 (AKAP 150) (CAMP-DEPENDENT PROTEIN KINASE REGU- 
LATORY SUBUNIT II HIGH AFFINITY BINDING PROTEIN) (PI 50) (FRAGMENT).//4.50E-09//357aa//21%//P24587 
TESTI20035800 
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TESTI20035890 

TESTI20036250//IMMEDIATE-EARLY PR0TEIN.//1 .60E-07//120aa//35%//Q01 042 
TESTI20036490//GLYCOPROTEIN X PRECURSOR.//? .40E-06//1 07aa//31%//P28968 

TESTI20037270 
5 TESTI20037810 
TESTI20038940 

TESTI200391 40//Tektin A1//2.1 0E-66//41 0aa//36%//A461 70 

TESTI20039980//T-CELL RECEPTOR BETA CHAIN ANA 11 .//4.40E-13//123aa//40%//P06333 
TESTI20040000//Rattus norvegicus cca2 mRNA, complete cds.//5.60E-82//179aa//B6%//AB000199 
10 TESTI20040310 
TESTI20041110 

TESTI20041220//Babesla blgemlna200 kDa antigen p200 mRNA, partial cds.//4.60E-05//481aa//19%//AF1 42406 
TESTI20042070//Columba llvia mRNA for 5'-nucleotldase.//8.20E-113//31 7aa//67%//AJ131243 
TESTI20042290//IVIHC CLASS II TRANSACTIVATOR GIITA.//6.30E-05//89aa//43%//P79621 
15 TESTI20042430 

TESTI20042870//X INACTIVE SPECIFIC TRANSCRIPT PROTEIN (FRAGMENT).//2.40E-06//155aa//32%//P27571 
TESTI20042950//AMINOPEPTIDASE B (EC 3.4.11.6) (ARGINYL AMINOPEPTIDASE) 

(ARGININE AMINOPEPTIDASE) (CYTOSOL AMINOPEPTIDASE IV) (AP-B).//4.40E-19//141aa//33%//009175 
TESTI20047120//POTENTIAL PHOSPHOLIPID-TRANSPORTING ATPASE IK (EC 3.6.1.-) (FRAGMENT).//1 . OOE- 
20 86//169aa//98%//060423 
TESTI20049290 

TESTI20049820//CGMP-DEPENDENT PROTEIN KINASE 1 , ALPHA ISOZYME (EC 2.7.1 .37) (CGK 1 ALPHA) (CG- 
KI-ALPHA).//6.60E-07//1 87aa//26%//Q1 3976 
TESTI20049940 
25 TESTI20051550 

TESTI20052680//Rattus norveglcus RSD-6 mRNA, complete cdsy/4.20E-61//261aa//55%//AF271155 
TESTI20053960//ZINC FINGER PROTEIN 132.//0//589aa//99%//P52740 

TESTI20054080//SER/THR-RICH PROTEIN T10 IN DGCR REGION.//4,90E-11 7//263aa//82%//P54797 
TESTI20054920 

30 TESTI20055840//Homo sapiens Tandem PH Domain Containing Proteln-1 mRNA, complete cds.//2.70E-1 62//331 aa// 

92%//AF286160 

TESTI20056900//ENVELOPE GLYCOPROTEIN GP340 (MEMBRANE ANTIGEN) (MA) [CONTAINS: GLYCOPRO- 
TEIN GP220].//1 .50E-06//171aa//32%//P03200 

TESTI20057310//CHROMOSOME SCAFFOLD PROTEIN SCII.//1 .90E-05//297aa//20%//Q90988 
35 TESTI20057420//ENDOZEPINE-RELATED PROTEIN PRECURSOR (MEMBRANE-ASSOCIATED 
DIAZEPAM BINDING INHIBITOR) (MA-DBI).//5.20E-225//501aa//B3%//P07106 
TESTI20058600//MYELOID UPREGULATED PROTEIN.//2.20E-68//1 67aa//74%//035682 

TESTI20062380 
TESTI20062550 
40 TESTI20064250 

TESTI20064830//Homo sapiens RAN binding protein 16 mRNA, complete cds.//5.00E-163//414aa//64%//AF064729 

TESTI20065720//PROTEIN D52 (N8 PR0TEIN)y/1 .00E-22//129aa//46%//P55327 

TESTI20067740 

TESTI20068660//Homo sapiens infertility-related sperm protein mRNA, complete cds.//5.90E-197//365aa//99%// 
45 AF311312 

TESTI20068720 
TESTI20069780 
TESTI20069790 

TESTI20071 830//Homo sapiens transcriptional intermediary factor 1 gamma mRNA, complete cds.//1.40E-125// 
50 233aa//99%//AF1 19043 
TESTI20073580 
TESTI20074020 

TESTI20074640//ZINC FINGER PROTEIN 85 (ZINC FINGER PROTEIN HPF4) (HTF1).//3.20E-119//428aa//45%// 
Q03923 

55 TESTI20074660//ZING FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7).//1 .30E-194//509aa//68%// 

Q05481 

TESTI20074800 
TESTI20076130 
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TESTI20077490 
TESTI20077500 

TESTI20078140//D7 PROTEIN.//1 .10E-25//99aa//49%//P13007 

TESTI20078640//Homo sapiens zinc finger protein ZNF232, exon 4 and complete cds.//2.60E-61//11 9aa//99%// 
5 AF080171 

TESTI20078670//RING CANAL PROTEIN (KELCH PROTEIN).//8.90E-09//269aa//20%//Q04652 
TESTI20078720/yiNTRACELLULAR PROTEIN TRANSPORT PROTEIN US01 .//2.90E-1 7//41 7aay/24%//P25386 
TESTI20079510//NEURAL CELL ADHESION MOLECULE, 140 KDAISOFORM PRECURSOR (N-CAM 140)(NCAM- 
140) (CD56 ANTIGEN).//0//723aa//96%//P13591 TESTI20080200//DPY-19 PROTEIN.//7.50E-114//621aa//38%// 

10 P34413 

TESTI20080330 
TESTI20081390 
TESTI20081440 
TESTI20082340 

15 TESTI20082400 
TESTI20083430 

TESTI20083870//CALCINEURIN B-LIKE PROTEIN (CBLP).//1 .80E-73//169aa//82%//P28470 
TESTI20084400 

TESTI20086570//MELANOMA-ASSOCIATED ANTIGEN B4 (MAGE-B4 ANTIGEN).//4.20E-88//347aa//52%//015481 
20 TESTI20087740 
TESTI20088470 
TESTI201 36910 

TESTI20138320//Xenopus laevis transketolase mRNA, complete cds.//1 .30E-128//315aa//75%//AF270484 
TESTI20140360//XAA-PRO DIPEPTIDASE (EC3.4. 13.9) (X-PRO DIPEPTIDASE) (PROLINE DIPEPTIDASE) (PRO- 
25 LIDASE) (IMIDODIPEPTIDASE).//1.40E-55//111aa//98%//P12955 
TESTI201 77400 

TESTI30000020//L,mexicana Imsap2 gene for secreted acid phosphatase 2 (SAP2).//2,00E-11//361aa//24%//Z46970 
THYMU10000020//Homo sapiens mRNA for Golgi protein {GPP34 gene).//2.20E-135//274aa//95%//AJ296152 
THYIVIU 10000320 

30 THYIV1U10000830//SUCCINATE DEHYDROGENASE [UBIQUINONE] FLAVOPROTEIN SUBUNIT, iVIITOCHONDRI- 
AL PRECURSOR (EC 1 .3.5.1) (FP) (FLAVOPROTEIN SUBUNIT OF COIVIPLEX ll).//7.30E-84//185aa//87%//P31040 
THYMU10001050 

THYMU10001760//SIALOADHESIN PRECURSOR (SER).//2.50E-42//127aa//71%//Q62230 
THYMU1 000291 0//Homo sapiens AP-4 adaptor complex beta4 subunit mRNA, complete cds. //1 .30E-64//1 29aa// 
35 97%//AF092094 
THYIVIU 10003290 

THYIVIU10003590//BETA-CHIIVIAERIN//2.00E-23//200aa//31%//P52757 
THYIVIU 10003660 
THYIVIU 10003820 

40 THYIVIU10004590//T-CELL-SPECIFiC TRANSCRIPTION FACTOR 1 (TCF-1) (T-CELL FACTOR 1) (TRANSCRIP- 
TION FACTOR-7).//3.70E-89//1 72aa//97%//Q0041 7 
THYIVIU 10004730 

THYIVIU 1 000491 0//Homo sapiens protein serine/threonine phosphatase 4 regulatory subunit 1 (PP4R1) mRNA, com- 
plete cds.//3.70E-49//1 44aa//65%//AF1 11 1 06 
45 THYMU 10005270 

THYMU10005580//Homo sapiens Sec22 homolog mRNA, complete cds.//7.00E-139//264aa//98%//AF1 00749 

THYMU20001400 

THYMU20002360 

THYMU20003170//Homo sapiens topoisomerase II alpha-4 (TOP2A) mRNA, partial cds.//5.80E-09//92aa//42%// 
50 AF285159 

THYMU20003690//MUS musculus syntrophln-assoclated serine-threonine protein kinase mRNA, complete cds.// 

3.20E-1 89//481 aa//73%//AF07781 8 
TRACH1 00001 80 

TRACH1 0000300//Anabaena PCC7120 hetC gene, complete cds://7.00E-12//200aa//29%//U55386 
55 TRACH1 0000570 

TRACH10000630//CDIVI PROTEIN (6C6-AG TUMOR-ASSOCIATED ANTIGEN) (DXS1357E).//5.00E-1 24//246aa// 
100%//P51572 

TRACH1 0000740//lg delta chain (WIE)//6.90E-251//513aa//90%//S17597 
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TRACH1 0001 000//Oryctolagus cuniculus PiUS mRNA, complete cds.//6.50E-33//68aa//95%//U74297 
TRACH1 0001 060 

TRACH1 0001250//lg delta chain (WIE)//1 .60E-233//513aa//95%//S17597 
TRACH1 0001 400 
5 TRACH20000150//DPP2C1//4.70E-05//152aa//30%//AAC28998 

TRACH20000790//Xenopus laevis Churchill protein mRNA, complete cds.//4.10E-47//112aa//71%//AF238862 

TRACH20001850 
TRACH20001960 
TRACH20002350 

10 TRACH20002370//ZINC FINGER PROTEIN 1 84 (FRAGMENT)y/5.40E-61//275aa//38%//Q99676 
TRACH20002500//HYPOTHETICAL 65.2 KDATRP-ASP REPEATS CONTAINING PROTEIN 
D2030.9 IN CHROMOSOME I.//1 .10E-82//375aa//42%//P90794 

TRACH20002890//GROWTH FACTOR RECEPTOR-BOUND PROTEIN 7(GRB7 ADAPTER PROTEIN) (EPIDERMAL 
GROWTH FACTOR RECEPTOR GRB-7).//3.90E-188//346aa//98%//Q03160 
15 TRACH2000393Q//Rattus non/egicus matrin 3 mRNA, complete cds.//4.40E-1 92//371 aa//96%//M63485 
TRACH20004110 

TRACH2000420Q//MAJOR SURFACE-LABELED TROPHOZOITE ANTIGEN PRECURSOR.//4.40E-06//96aa//33%// 
P21849 

TRACH20004610 

20 TRACH20004720//1-AMINOCYCLOPROPANE-1-CARBOXYLATE SYNTHASE 2 (EC 4.4.1 .14) (ACC SYNTHASE 2) 
(S-ADENOSYL-L-METHIONINE METHYLTHIOADENOSINE-LYASE 2) (ACS-2).//5.40E-56//396aa//31%//P18485 
TRACH20004960//Rattus norvegicus kidney-specific protein (KS) mRNA, complete cds.//2.90E-282//573aa//88%// 
AF062389 

TRACH20004970/mansacylases//7.70E-1 1//1 51 aa//32%//AAB94954 
25 TRACH20006650//Rattus norvegicus mRNA for putative integral membrane transport protein (UST1 r).//3.30E-1 87// 
552aa//62%//Y09945 

TRACH200Q6750//RADIAL SPOKE PROTEIN 3.//1 .90E-50//156aa//48%//P12759 

TRACH20007670 

TRACH200Q7aoo//Homo sapiens PTH-responsive osteosarcoma B1 protein (B1) mRNA, complete cds.//8.40E-230// 

30 339aa//98%//AF095771 

TRACH20008940//PROTEIN TSG24 (MEIOTIC CHECK POINT REGULAT0R).//1 .50E-202//384aa//97%//P53995 
TRACH20008980 

TRACH20009260//PROBABLE OXIDOREDUCTASE EPHD (EC 1.-.-.-).//5.80E-22//201aa//33%//Q10402 
TRACH20009440//ZINC FINGER PROTEIN 91 (ZINC FINGER PROTEIN HTF10) (HPF7). //3.80E-07//89aa//39%// 
35 Q05481 

TRACH20011920 

TRACH20012890//SEX-LETHAL PROTEIN HOMOLOG.//1.20E-07//115aa//33%//017310 TRACH20013950//Homo 
sapiens NY-REN-25 antigen mRNA, partial cds.//2.20E-50//104aa//56%//AF155103 

TRACH20014000 
40 TRACH20015920 
TRACH20016070 
UMVEN10001220 

UMVEN20001330//RABPHILIN-3A.//2.00E-18//272aa//30%//P47709 

UTERU10000770//GAMETOGENESIS EXPRESSED PROTEIN GEG-154.//5.30E-33//73aa//97%//P50636 
45 UTERU1 0000960/A/A ATP SYNTHASE SUBUNIT H (EC 3.6.1 .34) (V-ATPASE H SUBUNIT) (V- ATPASE M9.2 SUB- 
UNIT) (9.2 KDA MEMBRANE ACCESSORY 
PR0TEIN).//1 .60E-1 5//68aa//51 %//01 5342 

UTERU10001600//ZINC FINGER PROTEIN 191.//3.20E-90//346aa//52%//014754 UTERU1 0001 920 
UTERU20000470//Homo sapiens neuropilin-2(a0) mRNA, complete cds.//5.00E-20//61aa//80%//AF022859 
50 UTERU20003380 
UTERU20003930 

UTERU20004850//X-linl<ed retinopathy protein//1 .1 OE-1 6//97aa//51 %//A4601 0 

UTERU20005410 

UTERU20005690 



203 



EP 1 293 569 A2 



1 . A polynucleotide selected from the group consisting of the following (a) to (g): 

5 (a), a polynucleotide comprising a protein-coding region of the nucleotide sequence of any one of SEQ ID 

NOs: 1 to 1639; 

(b) a polynucleotide encoding a polypeptide comprising the amino acid sequence of any one of SEQ ID NOs: 
1640 to 3278; 

(c) a polynucleotide comprising a nucleotide sequence encoding a polypeptide comprising the amino acid 
10 sequence of any one of SEQ ID NOs: 1 640 to 3278, wherein, in said amino acid sequence, one or more amino 

acids have been substituted, deleted, inserted, and/or added, and wherein said nucleotide sequence encodes 
a polypeptide functionally equivalent to a polypeptide comprising the selected amino acid sequence; 

(d) a polynucleotide hybridizing to a polynucleotide comprising the nucleotide sequence of any one of SEQ 
ID NOs: 1 to 1639, wherein said nucleotide sequence encodes a polypeptide functionally equivalent to a 

15 polypeptide encoded by the selected nucleotide sequence; 

(e) a polynucleotide comprising a nucleotide sequence encoding a partial amino acid sequence of a polypep- 
tide encoded by the polynucleotide according to any one of (a) to (d); 

(f) a polynucleotide comprising a nucleotide sequence having at least 70% identity to the nucleotide sequence 
of any one of SEQ ID NOs: 1 to 1 639; and 

20 (g) a polynucleotide comprising a nucleotide sequence having at least 90% identity to the nucleotide sequence 

of any one of SEQ ID NOs: 1 to 1639. 

2. A polypeptide encoded by the polynucleotide of claim 1 , or a partial peptide thereof. 
25 3. An antibody binding to the polypeptide or the peptide of claim 2. 

4. A method for immunologically assaying the polypeptide or the peptide of claim 2, said method comprising the steps 
of contacting the polypeptide or the peptide of claim 2 with the antibody of claim 3, and observing the binding 
between the two. 

30 

5. A vector comprising the polynucleotide of claim 1 . 

6. A transformant carrying the polynucleotide of claim 1 or the vector of claim 5. 

35 7. A transformant carrying the polynucleotide of claim 1 or the vector of claim 5 in an expressible manner. 

8. A method for producing the polypeptide or the peptide of claim 2, said method comprising the steps of culturing 

the transformant of claim 7 and recovering an expression product. 

40 9. An oligonucleotide comprising at least 15 nucleotides, said oligonucleotide comprising a nucleotide sequence 
complementary to the nucleotide sequence of any one of SEQ ID NOs: 1 to 1 639 or to a complementary strand 
thereof. 

10. Use of the oligonucleotide of claim 9 as a primer for synthesizing the polynucleotide of claim 1. 

45 

11. Use of the oligonucleotide of claim 9 as a probe for detecting the polynucleotide of claim 1. 

12. An antisense polynucleotide against the polynucleotide of claim 1 or a part thereof. 

50 13. A method for detecting the polynucleotide of claim 1 , said method comprising the following steps of: 

a) incubating a target polynucleotide with the oligonucleotide of claim 9 under hybridizable conditions, and 

b) detecting hybridization of the target polynucleotide with the oligonucleotide of claim 9. 

55 14. A database of polynucleotides and/or polypeptides, said database comprising infonnatjon on at least one of the 
nucleotide sequences of SEQ ID NOs: 1 to 1639 and/or on at least one of the amino acid sequences of SEQ ID 
NOs: 1640 to 3278. 
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Figure 1 
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